Overview
Brought to you by YData
Dataset statistics
| Number of variables | 115 |
|---|---|
| Number of observations | 18866 |
| Missing cells | 611074 |
| Missing cells (%) | 28.2% |
| Total size in memory | 16.6 MiB |
| Average record size in memory | 920.0 B |
Variable types
| Text | 115 |
|---|
Dataset
| Description | Vertebrate Zoology Division - Mammalogy, Yale Peabody Museum 0061684-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.shrths |
accessRights has constant value "Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj" | Constant |
language has constant value "http://creativecommons.org/publicdomain/zero/1.0/" | Constant |
license has constant value "CC0_1_0" | Constant |
rightsHolder has constant value "Yale Peabody Museum" | Constant |
type has constant value "PhysicalObject" | Constant |
institutionCode has constant value "YPM" | Constant |
collectionCode has constant value "VZ" | Constant |
ownerInstitutionCode has constant value "YPM" | Constant |
basisOfRecord has constant value "PRESERVED_SPECIMEN" | Constant |
dataGeneralizations has constant value "Coordinate data unavailable" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
phylum has constant value "Chordata" | Constant |
class has constant value "Mammalia" | Constant |
nomenclaturalCode has constant value "ICZN" | Constant |
taxonRemarks has constant value "Animals and Plants: Vertebrates - Mammals" | Constant |
datasetKey has constant value "854f602e-f762-11e1-a439-00145eb45e9a" | Constant |
publishingCountry has constant value "US" | Constant |
mediaType has constant value "StillImage" | Constant |
phylumKey has constant value "44" | Constant |
classKey has constant value "359" | Constant |
protocol has constant value "EML" | Constant |
lastCrawled has constant value "2025-01-08T13:41:11.140Z" | Constant |
isSequenced has constant value "false" | Constant |
publishedByGbifRegion has constant value "NORTH_AMERICA" | Constant |
dataGeneralizations has 18800 (99.7%) missing values | Missing |
recordedBy has 4296 (22.8%) missing values | Missing |
sex has 10133 (53.7%) missing values | Missing |
lifeStage has 17963 (95.2%) missing values | Missing |
reproductiveCondition has 16576 (87.9%) missing values | Missing |
behavior has 18864 (> 99.9%) missing values | Missing |
preparations has 349 (1.8%) missing values | Missing |
associatedReferences has 12450 (66.0%) missing values | Missing |
associatedTaxa has 18487 (98.0%) missing values | Missing |
otherCatalogNumbers has 12652 (67.1%) missing values | Missing |
fieldNumber has 11555 (61.2%) missing values | Missing |
eventDate has 6567 (34.8%) missing values | Missing |
startDayOfYear has 7901 (41.9%) missing values | Missing |
endDayOfYear has 7901 (41.9%) missing values | Missing |
year has 6572 (34.8%) missing values | Missing |
month has 7472 (39.6%) missing values | Missing |
day has 7989 (42.3%) missing values | Missing |
habitat has 18739 (99.3%) missing values | Missing |
higherGeography has 3778 (20.0%) missing values | Missing |
continent has 3874 (20.5%) missing values | Missing |
waterBody has 18739 (99.3%) missing values | Missing |
countryCode has 3974 (21.1%) missing values | Missing |
stateProvince has 5347 (28.3%) missing values | Missing |
county has 9192 (48.7%) missing values | Missing |
municipality has 18309 (97.0%) missing values | Missing |
locality has 5869 (31.1%) missing values | Missing |
verbatimElevation has 17391 (92.2%) missing values | Missing |
decimalLatitude has 5543 (29.4%) missing values | Missing |
decimalLongitude has 5543 (29.4%) missing values | Missing |
coordinateUncertaintyInMeters has 5609 (29.7%) missing values | Missing |
georeferencedBy has 18537 (98.3%) missing values | Missing |
georeferencedDate has 10549 (55.9%) missing values | Missing |
georeferenceProtocol has 5610 (29.7%) missing values | Missing |
georeferenceSources has 5615 (29.8%) missing values | Missing |
georeferenceRemarks has 5661 (30.0%) missing values | Missing |
typeStatus has 18844 (99.9%) missing values | Missing |
identifiedBy has 17735 (94.0%) missing values | Missing |
dateIdentified has 17913 (94.9%) missing values | Missing |
identificationRemarks has 18863 (> 99.9%) missing values | Missing |
order has 406 (2.2%) missing values | Missing |
family has 684 (3.6%) missing values | Missing |
genus has 1248 (6.6%) missing values | Missing |
genericName has 1248 (6.6%) missing values | Missing |
specificEpithet has 2554 (13.5%) missing values | Missing |
infraspecificEpithet has 11638 (61.7%) missing values | Missing |
elevation has 17391 (92.2%) missing values | Missing |
elevationAccuracy has 18082 (95.8%) missing values | Missing |
distanceFromCentroidInMeters has 18788 (99.6%) missing values | Missing |
mediaType has 18411 (97.6%) missing values | Missing |
orderKey has 406 (2.2%) missing values | Missing |
familyKey has 684 (3.6%) missing values | Missing |
genusKey has 1248 (6.6%) missing values | Missing |
speciesKey has 2554 (13.5%) missing values | Missing |
species has 2554 (13.5%) missing values | Missing |
repatriated has 3910 (20.7%) missing values | Missing |
gbifRegion has 3929 (20.8%) missing values | Missing |
level0Gid has 5871 (31.1%) missing values | Missing |
level0Name has 5871 (31.1%) missing values | Missing |
level1Gid has 5888 (31.2%) missing values | Missing |
level1Name has 5888 (31.2%) missing values | Missing |
level2Gid has 5935 (31.5%) missing values | Missing |
level2Name has 5935 (31.5%) missing values | Missing |
level3Gid has 16539 (87.7%) missing values | Missing |
level3Name has 16544 (87.7%) missing values | Missing |
iucnRedListCategory has 7581 (40.2%) missing values | Missing |
gbifID has unique values | Unique |
bibliographicCitation has unique values | Unique |
references has unique values | Unique |
dynamicProperties has unique values | Unique |
occurrenceID has unique values | Unique |
catalogNumber has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 23:32:52.429790 |
|---|---|
| Analysis finished | 2025-01-08 23:32:54.507993 |
| Duration | 2.08 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 18866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 18866 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 4953409301 |
|---|---|
| 2nd row | 4911830319 |
| 3rd row | 4911830318 |
| 4th row | 4911830317 |
| 5th row | 4911830316 |
| Value | Count | Frequency (%) |
| 4953409301 | 1 | < 0.1% |
| 4599382340 | 1 | < 0.1% |
| 4911830315 | 1 | < 0.1% |
| 4911830314 | 1 | < 0.1% |
| 4911830313 | 1 | < 0.1% |
| 4911830312 | 1 | < 0.1% |
| 4911830311 | 1 | < 0.1% |
| 4911830310 | 1 | < 0.1% |
| 4911830309 | 1 | < 0.1% |
| 4911830308 | 1 | < 0.1% |
| Other values (18856) | 18856 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 30292 | |
| 3 | 27042 | |
| 5 | 25137 | |
| 9 | 22536 | |
| 0 | 22490 | |
| 2 | 21472 | |
| 4 | 11335 | 6.0% |
| 7 | 10804 | 5.7% |
| 8 | 8933 | 4.7% |
| 6 | 8619 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 188660 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 30292 | |
| 3 | 27042 | |
| 5 | 25137 | |
| 9 | 22536 | |
| 0 | 22490 | |
| 2 | 21472 | |
| 4 | 11335 | 6.0% |
| 7 | 10804 | 5.7% |
| 8 | 8933 | 4.7% |
| 6 | 8619 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 188660 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 30292 | |
| 3 | 27042 | |
| 5 | 25137 | |
| 9 | 22536 | |
| 0 | 22490 | |
| 2 | 21472 | |
| 4 | 11335 | 6.0% |
| 7 | 10804 | 5.7% |
| 8 | 8933 | 4.7% |
| 6 | 8619 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 188660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 30292 | |
| 3 | 27042 | |
| 5 | 25137 | |
| 9 | 22536 | |
| 0 | 22490 | |
| 2 | 21472 | |
| 4 | 11335 | 6.0% |
| 7 | 10804 | 5.7% |
| 8 | 8933 | 4.7% |
| 6 | 8619 | 4.6% |
accessRights
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 129 |
|---|---|
| Median length | 129 |
| Mean length | 129 |
| Min length | 129 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
|---|---|
| 2nd row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| 3rd row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| 4th row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| 5th row | Open Access, http://creativecommons.org/publicdomain/zero/1.0/; see Yale Peabody policies at: http://hdl.handle.net/10079/8931zqj |
| Value | Count | Frequency (%) |
| open | 18866 | |
| access | 18866 | |
| http://creativecommons.org/publicdomain/zero/1.0 | 18866 | |
| see | 18866 | |
| yale | 18866 | |
| peabody | 18866 | |
| policies | 18866 | |
| at | 18866 | |
| http://hdl.handle.net/10079/8931zqj | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 226392 | 9.3% |
| / | 188660 | 7.8% |
| 150928 | 6.2% | |
| t | 132062 | 5.4% |
| o | 132062 | 5.4% |
| a | 113196 | 4.7% |
| c | 113196 | 4.7% |
| i | 94330 | 3.9% |
| n | 94330 | 3.9% |
| s | 94330 | 3.9% |
| Other values (28) | 1094228 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1641342 | |
| Other Punctuation | 358454 | 14.7% |
| Decimal Number | 207526 | 8.5% |
| Space Separator | 150928 | 6.2% |
| Uppercase Letter | 75464 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 226392 | |
| t | 132062 | 8.0% |
| o | 132062 | 8.0% |
| a | 113196 | 6.9% |
| c | 113196 | 6.9% |
| i | 94330 | 5.7% |
| n | 94330 | 5.7% |
| s | 94330 | 5.7% |
| l | 94330 | 5.7% |
| p | 94330 | 5.7% |
| Other values (12) | 452784 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 56598 | |
| 0 | 56598 | |
| 9 | 37732 | |
| 8 | 18866 | 9.1% |
| 7 | 18866 | 9.1% |
| 3 | 18866 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 188660 | |
| . | 75464 | 21.1% |
| : | 56598 | 15.8% |
| ; | 18866 | 5.3% |
| , | 18866 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 18866 | |
| O | 18866 | |
| Y | 18866 | |
| A | 18866 |
Space Separator
| Value | Count | Frequency (%) |
| 150928 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1716806 | |
| Common | 716908 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 226392 | |
| t | 132062 | 7.7% |
| o | 132062 | 7.7% |
| a | 113196 | 6.6% |
| c | 113196 | 6.6% |
| i | 94330 | 5.5% |
| n | 94330 | 5.5% |
| s | 94330 | 5.5% |
| l | 94330 | 5.5% |
| p | 94330 | 5.5% |
| Other values (16) | 528248 |
Common
| Value | Count | Frequency (%) |
| / | 188660 | |
| 150928 | ||
| . | 75464 | 10.5% |
| : | 56598 | 7.9% |
| 1 | 56598 | 7.9% |
| 0 | 56598 | 7.9% |
| 9 | 37732 | 5.3% |
| 8 | 18866 | 2.6% |
| 7 | 18866 | 2.6% |
| 3 | 18866 | 2.6% |
| Other values (2) | 37732 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2433714 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 226392 | 9.3% |
| / | 188660 | 7.8% |
| 150928 | 6.2% | |
| t | 132062 | 5.4% |
| o | 132062 | 5.4% |
| a | 113196 | 4.7% |
| c | 113196 | 4.7% |
| i | 94330 | 3.9% |
| n | 94330 | 3.9% |
| s | 94330 | 3.9% |
| Other values (28) | 1094228 |
Unique 
| Distinct | 18866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 62 |
|---|---|
| Median length | 50 |
| Mean length | 40.04675077 |
| Min length | 20 |
Unique
| Unique | 18866 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Tamias striatus fisheri (YPM MAM 017903) |
|---|---|
| 2nd row | Peromyscus leucopus noveboracensis (YPM MAM 017889) |
| 3rd row | Peromyscus leucopus noveboracensis (YPM MAM 017897) |
| 4th row | Peromyscus leucopus noveboracensis (YPM MAM 017895) |
| 5th row | Peromyscus leucopus noveboracensis (YPM MAM 017888) |
| Value | Count | Frequency (%) |
| ypm | 18866 | 18.4% |
| mam | 18866 | 18.4% |
| peromyscus | 1837 | 1.8% |
| cinereus | 1489 | 1.5% |
| sorex | 1193 | 1.2% |
| brevicauda | 1125 | 1.1% |
| blarina | 976 | 1.0% |
| zibethicus | 898 | 0.9% |
| talpoides | 868 | 0.8% |
| gapperi | 848 | 0.8% |
| Other values (20938) | 55590 |
Most occurring characters
| Value | Count | Frequency (%) |
| 83690 | 11.1% | |
| M | 58523 | 7.7% |
| 0 | 44332 | 5.9% |
| s | 41623 | 5.5% |
| i | 36625 | 4.8% |
| a | 35093 | 4.6% |
| u | 30890 | 4.1% |
| e | 30381 | 4.0% |
| r | 26522 | 3.5% |
| o | 25267 | 3.3% |
| Other values (56) | 342576 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 370969 | |
| Uppercase Letter | 131912 | 17.5% |
| Decimal Number | 126705 | 16.8% |
| Space Separator | 83690 | 11.1% |
| Close Punctuation | 18866 | 2.5% |
| Open Punctuation | 18866 | 2.5% |
| Other Punctuation | 4512 | 0.6% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 41623 | |
| i | 36625 | |
| a | 35093 | |
| u | 30890 | 8.3% |
| e | 30381 | 8.2% |
| r | 26522 | 7.1% |
| o | 25267 | 6.8% |
| n | 22452 | 6.1% |
| c | 20781 | 5.6% |
| l | 16432 | 4.4% |
| Other values (16) | 84903 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 58523 | |
| P | 21973 | 16.7% |
| A | 19464 | 14.8% |
| Y | 18866 | 14.3% |
| C | 2505 | 1.9% |
| S | 1952 | 1.5% |
| B | 1452 | 1.1% |
| O | 1312 | 1.0% |
| T | 1217 | 0.9% |
| N | 831 | 0.6% |
| Other values (14) | 3817 | 2.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 44332 | |
| 1 | 20061 | |
| 2 | 9843 | 7.8% |
| 6 | 8199 | 6.5% |
| 7 | 8066 | 6.4% |
| 5 | 8017 | 6.3% |
| 4 | 7780 | 6.1% |
| 3 | 7635 | 6.0% |
| 9 | 6550 | 5.2% |
| 8 | 6222 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4510 | |
| ? | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 83690 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 18866 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 18866 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 502881 | |
| Common | 252641 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 58523 | 11.6% |
| s | 41623 | 8.3% |
| i | 36625 | 7.3% |
| a | 35093 | 7.0% |
| u | 30890 | 6.1% |
| e | 30381 | 6.0% |
| r | 26522 | 5.3% |
| o | 25267 | 5.0% |
| n | 22452 | 4.5% |
| P | 21973 | 4.4% |
| Other values (40) | 173532 |
Common
| Value | Count | Frequency (%) |
| 83690 | ||
| 0 | 44332 | |
| 1 | 20061 | 7.9% |
| ) | 18866 | 7.5% |
| ( | 18866 | 7.5% |
| 2 | 9843 | 3.9% |
| 6 | 8199 | 3.2% |
| 7 | 8066 | 3.2% |
| 5 | 8017 | 3.2% |
| 4 | 7780 | 3.1% |
| Other values (6) | 24921 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 755522 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 83690 | 11.1% | |
| M | 58523 | 7.7% |
| 0 | 44332 | 5.9% |
| s | 41623 | 5.5% |
| i | 36625 | 4.8% |
| a | 35093 | 4.6% |
| u | 30890 | 4.1% |
| e | 30381 | 4.0% |
| r | 26522 | 3.5% |
| o | 25267 | 3.3% |
| Other values (56) | 342576 |
language
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 49 |
|---|---|
| Median length | 49 |
| Mean length | 49 |
| Min length | 49 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | http://creativecommons.org/publicdomain/zero/1.0/ |
|---|---|
| 2nd row | http://creativecommons.org/publicdomain/zero/1.0/ |
| 3rd row | http://creativecommons.org/publicdomain/zero/1.0/ |
| 4th row | http://creativecommons.org/publicdomain/zero/1.0/ |
| 5th row | http://creativecommons.org/publicdomain/zero/1.0/ |
| Value | Count | Frequency (%) |
| http://creativecommons.org/publicdomain/zero/1.0 | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 113196 | 12.2% |
| o | 94330 | 10.2% |
| m | 56598 | 6.1% |
| c | 56598 | 6.1% |
| r | 56598 | 6.1% |
| e | 56598 | 6.1% |
| t | 56598 | 6.1% |
| i | 56598 | 6.1% |
| . | 37732 | 4.1% |
| n | 37732 | 4.1% |
| Other values (14) | 301856 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 716908 | |
| Other Punctuation | 169794 | 18.4% |
| Decimal Number | 37732 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 94330 | |
| m | 56598 | 7.9% |
| c | 56598 | 7.9% |
| r | 56598 | 7.9% |
| e | 56598 | 7.9% |
| t | 56598 | 7.9% |
| i | 56598 | 7.9% |
| n | 37732 | 5.3% |
| a | 37732 | 5.3% |
| p | 37732 | 5.3% |
| Other values (9) | 169794 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 113196 | |
| . | 37732 | 22.2% |
| : | 18866 | 11.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 18866 | |
| 0 | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 716908 | |
| Common | 207526 | 22.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 94330 | |
| m | 56598 | 7.9% |
| c | 56598 | 7.9% |
| r | 56598 | 7.9% |
| e | 56598 | 7.9% |
| t | 56598 | 7.9% |
| i | 56598 | 7.9% |
| n | 37732 | 5.3% |
| a | 37732 | 5.3% |
| p | 37732 | 5.3% |
| Other values (9) | 169794 |
Common
| Value | Count | Frequency (%) |
| / | 113196 | |
| . | 37732 | 18.2% |
| 1 | 18866 | 9.1% |
| : | 18866 | 9.1% |
| 0 | 18866 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 924434 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 113196 | 12.2% |
| o | 94330 | 10.2% |
| m | 56598 | 6.1% |
| c | 56598 | 6.1% |
| r | 56598 | 6.1% |
| e | 56598 | 6.1% |
| t | 56598 | 6.1% |
| i | 56598 | 6.1% |
| . | 37732 | 4.1% |
| n | 37732 | 4.1% |
| Other values (14) | 301856 |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 37732 | |
| 0 | 37732 | |
| _ | 37732 | |
| 1 | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 56598 | |
| Uppercase Letter | 37732 | |
| Connector Punctuation | 37732 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 37732 | |
| 1 | 18866 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 37732 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 94330 | |
| Latin | 37732 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 37732 | |
| _ | 37732 | |
| 1 | 18866 |
Latin
| Value | Count | Frequency (%) |
| C | 37732 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 132062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 37732 | |
| 0 | 37732 | |
| _ | 37732 | |
| 1 | 18866 |
modified
Text
| Distinct | 1200 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 667 ? |
|---|---|
| Unique (%) | 3.5% |
Sample
| 1st row | 2024-10-14T12:59:55Z |
|---|---|
| 2nd row | 2024-10-11T19:54:42Z |
| 3rd row | 2024-10-11T19:54:42Z |
| 4th row | 2024-10-11T19:54:42Z |
| 5th row | 2024-10-11T19:54:42Z |
| Value | Count | Frequency (%) |
| 2024-09-17t21:33:28z | 3971 | |
| 2024-10-12t17:36:53z | 3555 | |
| 2024-09-29t10:06:24z | 1799 | 9.5% |
| 2024-09-23t19:57:36z | 1572 | 8.3% |
| 2024-02-19t13:33:41z | 826 | 4.4% |
| 2024-04-16t21:52:31z | 553 | 2.9% |
| 2024-04-28t21:51:52z | 236 | 1.3% |
| 2024-10-22t21:33:57z | 219 | 1.2% |
| 2023-07-18t22:00:07z | 158 | 0.8% |
| 2020-12-23t21:50:47z | 157 | 0.8% |
| Other values (1190) | 5820 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 68042 | |
| 0 | 47839 | |
| 1 | 42959 | |
| - | 37732 | |
| : | 37732 | |
| 3 | 29809 | |
| 4 | 22029 | 5.8% |
| T | 18866 | 5.0% |
| Z | 18866 | 5.0% |
| 9 | 13778 | 3.7% |
| Other values (4) | 39668 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 264124 | |
| Dash Punctuation | 37732 | 10.0% |
| Other Punctuation | 37732 | 10.0% |
| Uppercase Letter | 37732 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 68042 | |
| 0 | 47839 | |
| 1 | 42959 | |
| 3 | 29809 | |
| 4 | 22029 | 8.3% |
| 9 | 13778 | 5.2% |
| 6 | 11557 | 4.4% |
| 5 | 11299 | 4.3% |
| 7 | 11126 | 4.2% |
| 8 | 5686 | 2.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37732 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 339588 | |
| Latin | 37732 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 68042 | |
| 0 | 47839 | |
| 1 | 42959 | |
| - | 37732 | |
| : | 37732 | |
| 3 | 29809 | |
| 4 | 22029 | 6.5% |
| 9 | 13778 | 4.1% |
| 6 | 11557 | 3.4% |
| 5 | 11299 | 3.3% |
| Other values (2) | 16812 | 5.0% |
Latin
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 377320 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 68042 | |
| 0 | 47839 | |
| 1 | 42959 | |
| - | 37732 | |
| : | 37732 | |
| 3 | 29809 | |
| 4 | 22029 | 5.8% |
| T | 18866 | 5.0% |
| Z | 18866 | 5.0% |
| 9 | 13778 | 3.7% |
| Other values (4) | 39668 |
references
Text
Unique 
| Distinct | 18866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 68 |
|---|---|
| Median length | 64 |
| Mean length | 64.95473338 |
| Min length | 64 |
Unique
| Unique | 18866 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://collections.peabody.yale.edu/search/Record/YPM-MAM-017903 |
|---|---|
| 2nd row | http://collections.peabody.yale.edu/search/Record/YPM-MAM-017889 |
| 3rd row | http://collections.peabody.yale.edu/search/Record/YPM-MAM-017897 |
| 4th row | http://collections.peabody.yale.edu/search/Record/YPM-MAM-017895 |
| 5th row | http://collections.peabody.yale.edu/search/Record/YPM-MAM-017888 |
| Value | Count | Frequency (%) |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017903 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017835 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017891 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017900 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017899 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017902 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017890 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017901 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017896 | 1 | < 0.1% |
| http://collections.peabody.yale.edu/search/record/ypm-mam-017898 | 1 | < 0.1% |
| Other values (18856) | 18856 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 113196 | 9.2% |
| / | 94330 | 7.7% |
| c | 75464 | 6.2% |
| o | 75464 | 6.2% |
| . | 61101 | 5.0% |
| M | 56598 | 4.6% |
| t | 56598 | 4.6% |
| l | 56598 | 4.6% |
| d | 56598 | 4.6% |
| a | 56598 | 4.6% |
| Other values (25) | 522891 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 754640 | |
| Other Punctuation | 174297 | 14.2% |
| Uppercase Letter | 132062 | 10.8% |
| Decimal Number | 126705 | 10.3% |
| Dash Punctuation | 37732 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 113196 | |
| c | 75464 | |
| o | 75464 | |
| t | 56598 | 7.5% |
| l | 56598 | 7.5% |
| d | 56598 | 7.5% |
| a | 56598 | 7.5% |
| r | 37732 | 5.0% |
| y | 37732 | 5.0% |
| h | 37732 | 5.0% |
| Other values (6) | 150928 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 44332 | |
| 1 | 20061 | |
| 2 | 9843 | 7.8% |
| 6 | 8199 | 6.5% |
| 7 | 8066 | 6.4% |
| 5 | 8017 | 6.3% |
| 4 | 7780 | 6.1% |
| 3 | 7635 | 6.0% |
| 9 | 6550 | 5.2% |
| 8 | 6222 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 56598 | |
| P | 18866 | 14.3% |
| A | 18866 | 14.3% |
| Y | 18866 | 14.3% |
| R | 18866 | 14.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 94330 | |
| . | 61101 | |
| : | 18866 | 10.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 886702 | |
| Common | 338734 | 27.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 113196 | |
| c | 75464 | 8.5% |
| o | 75464 | 8.5% |
| M | 56598 | 6.4% |
| t | 56598 | 6.4% |
| l | 56598 | 6.4% |
| d | 56598 | 6.4% |
| a | 56598 | 6.4% |
| r | 37732 | 4.3% |
| y | 37732 | 4.3% |
| Other values (11) | 264124 |
Common
| Value | Count | Frequency (%) |
| / | 94330 | |
| . | 61101 | |
| 0 | 44332 | |
| - | 37732 | 11.1% |
| 1 | 20061 | 5.9% |
| : | 18866 | 5.6% |
| 2 | 9843 | 2.9% |
| 6 | 8199 | 2.4% |
| 7 | 8066 | 2.4% |
| 5 | 8017 | 2.4% |
| Other values (4) | 28187 | 8.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1225436 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 113196 | 9.2% |
| / | 94330 | 7.7% |
| c | 75464 | 6.2% |
| o | 75464 | 6.2% |
| . | 61101 | 5.0% |
| M | 56598 | 4.6% |
| t | 56598 | 4.6% |
| l | 56598 | 4.6% |
| d | 56598 | 4.6% |
| a | 56598 | 4.6% |
| Other values (25) | 522891 |
rightsHolder
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yale Peabody Museum |
|---|---|
| 2nd row | Yale Peabody Museum |
| 3rd row | Yale Peabody Museum |
| 4th row | Yale Peabody Museum |
| 5th row | Yale Peabody Museum |
| Value | Count | Frequency (%) |
| yale | 18866 | |
| peabody | 18866 | |
| museum | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 56598 | |
| a | 37732 | |
| 37732 | ||
| u | 37732 | |
| Y | 18866 | 5.3% |
| l | 18866 | 5.3% |
| P | 18866 | 5.3% |
| b | 18866 | 5.3% |
| o | 18866 | 5.3% |
| d | 18866 | 5.3% |
| Other values (4) | 75464 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 264124 | |
| Uppercase Letter | 56598 | 15.8% |
| Space Separator | 37732 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 56598 | |
| a | 37732 | |
| u | 37732 | |
| l | 18866 | 7.1% |
| b | 18866 | 7.1% |
| o | 18866 | 7.1% |
| d | 18866 | 7.1% |
| y | 18866 | 7.1% |
| s | 18866 | 7.1% |
| m | 18866 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Space Separator
| Value | Count | Frequency (%) |
| 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 320722 | |
| Common | 37732 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 56598 | |
| a | 37732 | |
| u | 37732 | |
| Y | 18866 | 5.9% |
| l | 18866 | 5.9% |
| P | 18866 | 5.9% |
| b | 18866 | 5.9% |
| o | 18866 | 5.9% |
| d | 18866 | 5.9% |
| y | 18866 | 5.9% |
| Other values (3) | 56598 |
Common
| Value | Count | Frequency (%) |
| 37732 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 358454 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 56598 | |
| a | 37732 | |
| 37732 | ||
| u | 37732 | |
| Y | 18866 | 5.3% |
| l | 18866 | 5.3% |
| P | 18866 | 5.3% |
| b | 18866 | 5.3% |
| o | 18866 | 5.3% |
| d | 18866 | 5.3% |
| Other values (4) | 75464 |
type
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PhysicalObject |
|---|---|
| 2nd row | PhysicalObject |
| 3rd row | PhysicalObject |
| 4th row | PhysicalObject |
| 5th row | PhysicalObject |
| Value | Count | Frequency (%) |
| physicalobject | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 37732 | |
| P | 18866 | 7.1% |
| h | 18866 | 7.1% |
| y | 18866 | 7.1% |
| s | 18866 | 7.1% |
| i | 18866 | 7.1% |
| a | 18866 | 7.1% |
| l | 18866 | 7.1% |
| O | 18866 | 7.1% |
| b | 18866 | 7.1% |
| Other values (3) | 56598 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 226392 | |
| Uppercase Letter | 37732 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 37732 | |
| h | 18866 | |
| y | 18866 | |
| s | 18866 | |
| i | 18866 | |
| a | 18866 | |
| l | 18866 | |
| b | 18866 | |
| j | 18866 | |
| e | 18866 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 18866 | |
| O | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 264124 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 37732 | |
| P | 18866 | 7.1% |
| h | 18866 | 7.1% |
| y | 18866 | 7.1% |
| s | 18866 | 7.1% |
| i | 18866 | 7.1% |
| a | 18866 | 7.1% |
| l | 18866 | 7.1% |
| O | 18866 | 7.1% |
| b | 18866 | 7.1% |
| Other values (3) | 56598 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 264124 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 37732 | |
| P | 18866 | 7.1% |
| h | 18866 | 7.1% |
| y | 18866 | 7.1% |
| s | 18866 | 7.1% |
| i | 18866 | 7.1% |
| a | 18866 | 7.1% |
| l | 18866 | 7.1% |
| O | 18866 | 7.1% |
| b | 18866 | 7.1% |
| Other values (3) | 56598 |
datasetID
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 0 | 18321 | |
| 1 | 545 | 2.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18321 | |
| 1 | 545 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18866 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18321 | |
| 1 | 545 | 2.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18866 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 18321 | |
| 1 | 545 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18866 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 18321 | |
| 1 | 545 | 2.9% |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | YPM |
|---|---|
| 2nd row | YPM |
| 3rd row | YPM |
| 4th row | YPM |
| 5th row | YPM |
| Value | Count | Frequency (%) |
| ypm | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 56598 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | VZ |
|---|---|
| 2nd row | VZ |
| 3rd row | VZ |
| 4th row | VZ |
| 5th row | VZ |
| Value | Count | Frequency (%) |
| vz | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| V | 18866 | |
| Z | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 37732 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 18866 | |
| Z | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37732 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| V | 18866 | |
| Z | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37732 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| V | 18866 | |
| Z | 18866 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | YPM |
|---|---|
| 2nd row | YPM |
| 3rd row | YPM |
| 4th row | YPM |
| 5th row | YPM |
| Value | Count | Frequency (%) |
| ypm | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 56598 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 18866 | |
| P | 18866 | |
| M | 18866 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 94330 | |
| P | 37732 | 11.1% |
| R | 37732 | 11.1% |
| S | 37732 | 11.1% |
| V | 18866 | 5.6% |
| D | 18866 | 5.6% |
| _ | 18866 | 5.6% |
| C | 18866 | 5.6% |
| I | 18866 | 5.6% |
| M | 18866 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 320722 | |
| Connector Punctuation | 18866 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 94330 | |
| P | 37732 | 11.8% |
| R | 37732 | 11.8% |
| S | 37732 | 11.8% |
| V | 18866 | 5.9% |
| D | 18866 | 5.9% |
| C | 18866 | 5.9% |
| I | 18866 | 5.9% |
| M | 18866 | 5.9% |
| N | 18866 | 5.9% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 320722 | |
| Common | 18866 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 94330 | |
| P | 37732 | 11.8% |
| R | 37732 | 11.8% |
| S | 37732 | 11.8% |
| V | 18866 | 5.9% |
| D | 18866 | 5.9% |
| C | 18866 | 5.9% |
| I | 18866 | 5.9% |
| M | 18866 | 5.9% |
| N | 18866 | 5.9% |
Common
| Value | Count | Frequency (%) |
| _ | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 339588 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 94330 | |
| P | 37732 | 11.1% |
| R | 37732 | 11.1% |
| S | 37732 | 11.1% |
| V | 18866 | 5.6% |
| D | 18866 | 5.6% |
| _ | 18866 | 5.6% |
| C | 18866 | 5.6% |
| I | 18866 | 5.6% |
| M | 18866 | 5.6% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 18800 |
| Missing (%) | 99.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 27 |
|---|---|
| Median length | 27 |
| Mean length | 27 |
| Min length | 27 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Coordinate data unavailable |
|---|---|
| 2nd row | Coordinate data unavailable |
| 3rd row | Coordinate data unavailable |
| 4th row | Coordinate data unavailable |
| 5th row | Coordinate data unavailable |
| Value | Count | Frequency (%) |
| coordinate | 66 | |
| data | 66 | |
| unavailable | 66 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 396 | |
| o | 132 | 7.4% |
| d | 132 | 7.4% |
| i | 132 | 7.4% |
| n | 132 | 7.4% |
| t | 132 | 7.4% |
| e | 132 | 7.4% |
| 132 | 7.4% | |
| l | 132 | 7.4% |
| C | 66 | 3.7% |
| Other values (4) | 264 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1584 | |
| Space Separator | 132 | 7.4% |
| Uppercase Letter | 66 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 396 | |
| o | 132 | 8.3% |
| d | 132 | 8.3% |
| i | 132 | 8.3% |
| n | 132 | 8.3% |
| t | 132 | 8.3% |
| e | 132 | 8.3% |
| l | 132 | 8.3% |
| r | 66 | 4.2% |
| u | 66 | 4.2% |
| Other values (2) | 132 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 132 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 66 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1650 | |
| Common | 132 | 7.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 396 | |
| o | 132 | 8.0% |
| d | 132 | 8.0% |
| i | 132 | 8.0% |
| n | 132 | 8.0% |
| t | 132 | 8.0% |
| e | 132 | 8.0% |
| l | 132 | 8.0% |
| C | 66 | 4.0% |
| r | 66 | 4.0% |
| Other values (3) | 198 |
Common
| Value | Count | Frequency (%) |
| 132 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1782 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 396 | |
| o | 132 | 7.4% |
| d | 132 | 7.4% |
| i | 132 | 7.4% |
| n | 132 | 7.4% |
| t | 132 | 7.4% |
| e | 132 | 7.4% |
| 132 | 7.4% | |
| l | 132 | 7.4% |
| C | 66 | 3.7% |
| Other values (4) | 264 |
Unique 
| Distinct | 18866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 1073 |
|---|---|
| Median length | 877 |
| Mean length | 64.79444503 |
| Min length | 19 |
Unique
| Unique | 18866 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | { "irn": "2495311" } |
|---|---|
| 2nd row | { "irn": "2489043", "media": "1223142:2398869c-63eb-410d-8cf8-205d5aacbfcd", "mm_repository_id": "1223142" } |
| 3rd row | { "irn": "2489051", "media": "1223150:ed40315a-fb57-4421-a251-a7ede5b38478", "mm_repository_id": "1223150" } |
| 4th row | { "irn": "2489049", "media": "1223148:3d1eee9f-f1e6-4948-b842-640fbf489e2a", "mm_repository_id": "1223148" } |
| 5th row | { "irn": "2489042", "media": "1223141:56aefa44-5e83-4aec-83f3-b632bc2756cf", "mm_repository_id": "1223141" } |
| Value | Count | Frequency (%) |
| 38111 | ||
| irn | 18866 | |
| solr_long_lat | 13323 | 10.5% |
| original_num | 6214 | 4.9% |
| osteo | 4381 | 3.4% |
| mm_repository_id | 455 | 0.4% |
| media | 455 | 0.4% |
| related_record_links | 379 | 0.3% |
| related_record_types | 379 | 0.3% |
| 71.273830,44.049466 | 311 | 0.2% |
| Other values (33627) | 44501 |
Most occurring characters
| Value | Count | Frequency (%) |
| " | 160284 | 13.1% |
| 108509 | 8.9% | |
| 1 | 48769 | 4.0% |
| l | 47322 | 3.9% |
| n | 44996 | 3.7% |
| 0 | 44312 | 3.6% |
| 4 | 43624 | 3.6% |
| r | 41589 | 3.4% |
| : | 41225 | 3.4% |
| 3 | 41094 | 3.4% |
| Other values (56) | 600688 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 390133 | |
| Lowercase Letter | 325132 | |
| Other Punctuation | 272361 | |
| Space Separator | 108509 | 8.9% |
| Connector Punctuation | 35286 | 2.9% |
| Uppercase Letter | 26260 | 2.1% |
| Open Punctuation | 23249 | 1.9% |
| Close Punctuation | 23247 | 1.9% |
| Dash Punctuation | 18214 | 1.5% |
| Math Symbol | 19 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 10867 | |
| O | 8766 | |
| A | 5225 | |
| P | 820 | 3.1% |
| Y | 404 | 1.5% |
| R | 105 | 0.4% |
| H | 10 | < 0.1% |
| S | 8 | < 0.1% |
| C | 8 | < 0.1% |
| E | 7 | < 0.1% |
| Other values (11) | 40 | 0.2% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 47322 | |
| n | 44996 | |
| r | 41589 | |
| o | 38912 | |
| i | 33038 | |
| a | 23318 | |
| g | 19538 | |
| t | 19299 | |
| s | 18921 | 5.8% |
| e | 10092 | 3.1% |
| Other values (10) | 28107 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 48769 | |
| 0 | 44312 | |
| 4 | 43624 | |
| 3 | 41094 | |
| 5 | 40074 | |
| 7 | 40008 | |
| 6 | 38040 | |
| 2 | 36853 | |
| 9 | 29256 | |
| 8 | 28103 |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 160284 | |
| : | 41225 | 15.1% |
| . | 36322 | 13.3% |
| , | 34528 | 12.7% |
| / | 1 | < 0.1% |
| ? | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| { | 18866 | |
| ( | 4383 | 18.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| } | 18866 | |
| ) | 4381 | 18.8% |
Space Separator
| Value | Count | Frequency (%) |
| 108509 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 35286 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18214 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 19 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 871020 | |
| Latin | 351392 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 47322 | |
| n | 44996 | |
| r | 41589 | |
| o | 38912 | |
| i | 33038 | |
| a | 23318 | |
| g | 19538 | 5.6% |
| t | 19299 | 5.5% |
| s | 18921 | 5.4% |
| M | 10867 | 3.1% |
| Other values (31) | 53592 |
Common
| Value | Count | Frequency (%) |
| " | 160284 | |
| 108509 | ||
| 1 | 48769 | 5.6% |
| 0 | 44312 | 5.1% |
| 4 | 43624 | 5.0% |
| : | 41225 | 4.7% |
| 3 | 41094 | 4.7% |
| 5 | 40074 | 4.6% |
| 7 | 40008 | 4.6% |
| 6 | 38040 | 4.4% |
| Other values (15) | 265081 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1222412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| " | 160284 | 13.1% |
| 108509 | 8.9% | |
| 1 | 48769 | 4.0% |
| l | 47322 | 3.9% |
| n | 44996 | 3.7% |
| 0 | 44312 | 3.6% |
| 4 | 43624 | 3.6% |
| r | 41589 | 3.4% |
| : | 41225 | 3.4% |
| 3 | 41094 | 3.4% |
| Other values (56) | 600688 |
occurrenceID
Text
Unique 
| Distinct | 18866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 18866 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | urn:uuid:ef710e32-eb63-4875-b9d8-f21a261c1f52 |
|---|---|
| 2nd row | urn:uuid:2df9a10d-0595-4c2d-bb13-43b6677a15ce |
| 3rd row | urn:uuid:35474ea7-f956-4872-88c2-a8c56cbe9f90 |
| 4th row | urn:uuid:6eaa6b8b-f8a1-44ee-b671-1a734de9ada2 |
| 5th row | urn:uuid:b45e450f-3835-46af-be66-6494f44d014e |
| Value | Count | Frequency (%) |
| urn:uuid:ef710e32-eb63-4875-b9d8-f21a261c1f52 | 1 | < 0.1% |
| urn:uuid:7a7bd1dd-0c61-423e-8d79-316ae9466af3 | 1 | < 0.1% |
| urn:uuid:c2221631-94d5-4364-b7a1-6e8875d768ba | 1 | < 0.1% |
| urn:uuid:565e73ca-2d43-4f72-bf13-66ca168617ad | 1 | < 0.1% |
| urn:uuid:8ebc41fa-c154-4c27-a7d4-606e62b2dc95 | 1 | < 0.1% |
| urn:uuid:9ba9abd0-a03f-49c3-97e5-d8a6557c42bd | 1 | < 0.1% |
| urn:uuid:183dfe30-8155-4c5d-ae5d-15cc0b7ea3b8 | 1 | < 0.1% |
| urn:uuid:fa9cc82d-fccf-4fb9-834c-a5e890e5ff61 | 1 | < 0.1% |
| urn:uuid:b4b795a6-619a-4d62-b9a2-3c911f103ed3 | 1 | < 0.1% |
| urn:uuid:a57c6e6c-6f11-465a-a440-a5953d2cf9d2 | 1 | < 0.1% |
| Other values (18856) | 18856 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 75464 | 8.9% |
| u | 56598 | 6.7% |
| 4 | 54374 | 6.4% |
| d | 54159 | 6.4% |
| 8 | 40303 | 4.7% |
| 9 | 40140 | 4.7% |
| b | 40080 | 4.7% |
| a | 39808 | 4.7% |
| : | 37732 | 4.4% |
| f | 35654 | 4.2% |
| Other values (12) | 374658 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 382516 | |
| Lowercase Letter | 353258 | |
| Dash Punctuation | 75464 | 8.9% |
| Other Punctuation | 37732 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 56598 | |
| d | 54159 | |
| b | 40080 | |
| a | 39808 | |
| f | 35654 | |
| e | 35275 | |
| c | 35086 | |
| r | 18866 | 5.3% |
| i | 18866 | 5.3% |
| n | 18866 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 54374 | |
| 8 | 40303 | |
| 9 | 40140 | |
| 1 | 35621 | |
| 5 | 35502 | |
| 2 | 35421 | |
| 7 | 35401 | |
| 0 | 35374 | |
| 6 | 35239 | |
| 3 | 35141 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 75464 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 495712 | |
| Latin | 353258 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 75464 | |
| 4 | 54374 | |
| 8 | 40303 | |
| 9 | 40140 | |
| : | 37732 | |
| 1 | 35621 | |
| 5 | 35502 | |
| 2 | 35421 | |
| 7 | 35401 | |
| 0 | 35374 | |
| Other values (2) | 70380 |
Latin
| Value | Count | Frequency (%) |
| u | 56598 | |
| d | 54159 | |
| b | 40080 | |
| a | 39808 | |
| f | 35654 | |
| e | 35275 | |
| c | 35086 | |
| r | 18866 | 5.3% |
| i | 18866 | 5.3% |
| n | 18866 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 848970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 75464 | 8.9% |
| u | 56598 | 6.7% |
| 4 | 54374 | 6.4% |
| d | 54159 | 6.4% |
| 8 | 40303 | 4.7% |
| 9 | 40140 | 4.7% |
| b | 40080 | 4.7% |
| a | 39808 | 4.7% |
| : | 37732 | 4.4% |
| f | 35654 | 4.2% |
| Other values (12) | 374658 |
catalogNumber
Text
Unique 
| Distinct | 18866 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 14 |
| Mean length | 14.95473338 |
| Min length | 14 |
Unique
| Unique | 18866 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | YPM MAM 017903 |
|---|---|
| 2nd row | YPM MAM 017889 |
| 3rd row | YPM MAM 017897 |
| 4th row | YPM MAM 017895 |
| 5th row | YPM MAM 017888 |
| Value | Count | Frequency (%) |
| ypm | 18866 | |
| mam | 18866 | |
| 015555.002 | 1 | < 0.1% |
| 017813 | 1 | < 0.1% |
| 017899 | 1 | < 0.1% |
| 017902 | 1 | < 0.1% |
| 017890 | 1 | < 0.1% |
| 017901 | 1 | < 0.1% |
| 017896 | 1 | < 0.1% |
| 017898 | 1 | < 0.1% |
| Other values (18858) | 18858 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 56598 | |
| 0 | 44332 | |
| 37732 | ||
| 1 | 20061 | 7.1% |
| Y | 18866 | 6.7% |
| P | 18866 | 6.7% |
| A | 18866 | 6.7% |
| 2 | 9843 | 3.5% |
| 6 | 8199 | 2.9% |
| 7 | 8066 | 2.9% |
| Other values (6) | 40707 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 126705 | |
| Uppercase Letter | 113196 | |
| Space Separator | 37732 | 13.4% |
| Other Punctuation | 4503 | 1.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 44332 | |
| 1 | 20061 | |
| 2 | 9843 | 7.8% |
| 6 | 8199 | 6.5% |
| 7 | 8066 | 6.4% |
| 5 | 8017 | 6.3% |
| 4 | 7780 | 6.1% |
| 3 | 7635 | 6.0% |
| 9 | 6550 | 5.2% |
| 8 | 6222 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 56598 | |
| Y | 18866 | 16.7% |
| P | 18866 | 16.7% |
| A | 18866 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 37732 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4503 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 168940 | |
| Latin | 113196 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 44332 | |
| 37732 | ||
| 1 | 20061 | |
| 2 | 9843 | 5.8% |
| 6 | 8199 | 4.9% |
| 7 | 8066 | 4.8% |
| 5 | 8017 | 4.7% |
| 4 | 7780 | 4.6% |
| 3 | 7635 | 4.5% |
| 9 | 6550 | 3.9% |
| Other values (2) | 10725 | 6.3% |
Latin
| Value | Count | Frequency (%) |
| M | 56598 | |
| Y | 18866 | 16.7% |
| P | 18866 | 16.7% |
| A | 18866 | 16.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 282136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 56598 | |
| 0 | 44332 | |
| 37732 | ||
| 1 | 20061 | 7.1% |
| Y | 18866 | 6.7% |
| P | 18866 | 6.7% |
| A | 18866 | 6.7% |
| 2 | 9843 | 3.5% |
| 6 | 8199 | 2.9% |
| 7 | 8066 | 2.9% |
| Other values (6) | 40707 |
recordedBy
Text
Missing 
| Distinct | 1050 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 4296 |
| Missing (%) | 22.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 120 |
|---|---|
| Median length | 80 |
| Mean length | 16.20549073 |
| Min length | 3 |
Unique
| Unique | 526 ? |
|---|---|
| Unique (%) | 3.6% |
Sample
| 1st row | Richard E. Boardman, Kristof Zyskowski |
|---|---|
| 2nd row | Richard E. Boardman |
| 3rd row | Lourdes M. Rojas |
| 4th row | Richard E. Boardman |
| 5th row | Richard E. Boardman |
| Value | Count | Frequency (%) |
| mariko | 1875 | 4.7% |
| yamasaki | 1875 | 4.7% |
| e | 1394 | 3.5% |
| b | 1115 | 2.8% |
| c | 1091 | 2.7% |
| j | 1070 | 2.7% |
| a | 867 | 2.2% |
| ryan | 849 | 2.1% |
| stephens | 848 | 2.1% |
| d | 830 | 2.1% |
| Other values (1289) | 28092 |
Most occurring characters
| Value | Count | Frequency (%) |
| 25336 | 10.7% | |
| a | 21256 | 9.0% |
| e | 16815 | 7.1% |
| r | 13987 | 5.9% |
| i | 13506 | 5.7% |
| o | 12209 | 5.2% |
| n | 11603 | 4.9% |
| . | 10574 | 4.5% |
| l | 9856 | 4.2% |
| s | 8143 | 3.4% |
| Other values (59) | 92829 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 157721 | |
| Uppercase Letter | 40869 | 17.3% |
| Space Separator | 25336 | 10.7% |
| Other Punctuation | 11218 | 4.8% |
| Decimal Number | 552 | 0.2% |
| Dash Punctuation | 416 | 0.2% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 21256 | |
| e | 16815 | |
| r | 13987 | 8.9% |
| i | 13506 | 8.6% |
| o | 12209 | 7.7% |
| n | 11603 | 7.4% |
| l | 9856 | 6.2% |
| s | 8143 | 5.2% |
| t | 6641 | 4.2% |
| m | 6447 | 4.1% |
| Other values (17) | 37258 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4521 | 11.1% |
| R | 4515 | 11.0% |
| C | 3379 | 8.3% |
| S | 2982 | 7.3% |
| J | 2726 | 6.7% |
| E | 2666 | 6.5% |
| B | 2557 | 6.3% |
| D | 2059 | 5.0% |
| G | 1997 | 4.9% |
| Y | 1954 | 4.8% |
| Other values (15) | 11513 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 190 | |
| 7 | 70 | 12.7% |
| 8 | 69 | 12.5% |
| 9 | 68 | 12.3% |
| 6 | 64 | 11.6% |
| 2 | 43 | 7.8% |
| 0 | 41 | 7.4% |
| 3 | 7 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10574 | |
| , | 565 | 5.0% |
| & | 44 | 0.4% |
| ' | 32 | 0.3% |
| / | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 25336 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 416 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 198590 | |
| Common | 37524 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 21256 | 10.7% |
| e | 16815 | 8.5% |
| r | 13987 | 7.0% |
| i | 13506 | 6.8% |
| o | 12209 | 6.1% |
| n | 11603 | 5.8% |
| l | 9856 | 5.0% |
| s | 8143 | 4.1% |
| t | 6641 | 3.3% |
| m | 6447 | 3.2% |
| Other values (42) | 78127 |
Common
| Value | Count | Frequency (%) |
| 25336 | ||
| . | 10574 | |
| , | 565 | 1.5% |
| - | 416 | 1.1% |
| 1 | 190 | 0.5% |
| 7 | 70 | 0.2% |
| 8 | 69 | 0.2% |
| 9 | 68 | 0.2% |
| 6 | 64 | 0.2% |
| & | 44 | 0.1% |
| Other values (7) | 128 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 236040 | |
| None | 74 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 25336 | 10.7% | |
| a | 21256 | 9.0% |
| e | 16815 | 7.1% |
| r | 13987 | 5.9% |
| i | 13506 | 5.7% |
| o | 12209 | 5.2% |
| n | 11603 | 4.9% |
| . | 10574 | 4.5% |
| l | 9856 | 4.2% |
| s | 8143 | 3.4% |
| Other values (58) | 92755 |
None
| Value | Count | Frequency (%) |
| ü | 74 |
individualCount
Text
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000318032 |
| Min length | 1 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 18844 | |
| 2 | 5 | < 0.1% |
| 3 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 11 | 2 | < 0.1% |
| 17 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| Other values (3) | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 18850 | |
| 2 | 6 | < 0.1% |
| 3 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18872 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 18850 | |
| 2 | 6 | < 0.1% |
| 3 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18872 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 18850 | |
| 2 | 6 | < 0.1% |
| 3 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18872 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 18850 | |
| 2 | 6 | < 0.1% |
| 3 | 4 | < 0.1% |
| 6 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 0 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
sex
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10133 |
| Missing (%) | 53.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.911714188 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FEMALE |
|---|---|
| 2nd row | FEMALE |
| 3rd row | MALE |
| 4th row | MALE |
| 5th row | FEMALE |
| Value | Count | Frequency (%) |
| male | 4752 | |
| female | 3981 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 12714 | |
| M | 8733 | |
| A | 8733 | |
| L | 8733 | |
| F | 3981 | 9.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 42894 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 12714 | |
| M | 8733 | |
| A | 8733 | |
| L | 8733 | |
| F | 3981 | 9.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42894 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 12714 | |
| M | 8733 | |
| A | 8733 | |
| L | 8733 | |
| F | 3981 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42894 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 12714 | |
| M | 8733 | |
| A | 8733 | |
| L | 8733 | |
| F | 3981 | 9.3% |
lifeStage
Text
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 17963 |
| Missing (%) | 95.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.280177187 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Adult |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 508 | |
| juvenile | 322 | |
| immature | 29 | 3.2% |
| neonate | 25 | 2.8% |
| subadult | 17 | 1.9% |
| embryo | 2 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 893 | |
| l | 847 | |
| e | 723 | |
| t | 579 | |
| d | 525 | |
| A | 508 | |
| n | 347 | 6.1% |
| J | 322 | 5.7% |
| v | 322 | 5.7% |
| i | 322 | 5.7% |
| Other values (10) | 283 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4768 | |
| Uppercase Letter | 903 | 15.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 893 | |
| l | 847 | |
| e | 723 | |
| t | 579 | |
| d | 525 | |
| n | 347 | 7.3% |
| v | 322 | 6.8% |
| i | 322 | 6.8% |
| a | 71 | 1.5% |
| m | 60 | 1.3% |
| Other values (4) | 79 | 1.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 508 | |
| J | 322 | |
| I | 29 | 3.2% |
| N | 25 | 2.8% |
| S | 17 | 1.9% |
| E | 2 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5671 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 893 | |
| l | 847 | |
| e | 723 | |
| t | 579 | |
| d | 525 | |
| A | 508 | |
| n | 347 | 6.1% |
| J | 322 | 5.7% |
| v | 322 | 5.7% |
| i | 322 | 5.7% |
| Other values (10) | 283 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5671 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 893 | |
| l | 847 | |
| e | 723 | |
| t | 579 | |
| d | 525 | |
| A | 508 | |
| n | 347 | 6.1% |
| J | 322 | 5.7% |
| v | 322 | 5.7% |
| i | 322 | 5.7% |
| Other values (10) | 283 | 5.0% |
Missing 
| Distinct | 626 |
|---|---|
| Distinct (%) | 27.3% |
| Missing | 16576 |
| Missing (%) | 87.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 166 |
|---|---|
| Median length | 116 |
| Mean length | 12.40349345 |
| Min length | 2 |
Unique
| Unique | 457 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | testes 5 x 2 mm |
|---|---|
| 2nd row | EMB; 6; 10x8 |
| 3rd row | SCR; L=6x4 |
| 4th row | SCR R=8x5 |
| 5th row | EMB; L=4; R=2, 14X18 |
| Value | Count | Frequency (%) |
| testes | 1006 | |
| mm | 877 | 14.1% |
| embryo | 650 | 10.4% |
| no | 643 | 10.3% |
| 3 | 151 | 2.4% |
| 2 | 137 | 2.2% |
| embryos | 137 | 2.2% |
| lactating | 137 | 2.2% |
| 4 | 135 | 2.2% |
| 5 | 111 | 1.8% |
| Other values (469) | 2242 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3936 | ||
| e | 3252 | |
| m | 2969 | 10.5% |
| s | 2466 | 8.7% |
| t | 2401 | 8.5% |
| o | 1669 | 5.9% |
| r | 1048 | 3.7% |
| n | 966 | 3.4% |
| b | 824 | 2.9% |
| y | 821 | 2.9% |
| Other values (60) | 8052 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18999 | |
| Space Separator | 3936 | 13.9% |
| Decimal Number | 2531 | 8.9% |
| Uppercase Letter | 1676 | 5.9% |
| Other Punctuation | 737 | 2.6% |
| Math Symbol | 248 | 0.9% |
| Dash Punctuation | 229 | 0.8% |
| Open Punctuation | 24 | 0.1% |
| Close Punctuation | 24 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3252 | |
| m | 2969 | |
| s | 2466 | |
| t | 2401 | |
| o | 1669 | |
| r | 1048 | 5.5% |
| n | 966 | 5.1% |
| b | 824 | 4.3% |
| y | 821 | 4.3% |
| a | 601 | 3.2% |
| Other values (15) | 1982 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 354 | |
| T | 280 | |
| L | 194 | |
| S | 157 | |
| C | 141 | 8.4% |
| P | 128 | 7.6% |
| N | 125 | 7.5% |
| A | 86 | 5.1% |
| E | 70 | 4.2% |
| B | 49 | 2.9% |
| Other values (8) | 92 | 5.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 466 | |
| 1 | 447 | |
| 2 | 372 | |
| 3 | 348 | |
| 4 | 251 | |
| 0 | 199 | |
| 6 | 162 | 6.4% |
| 7 | 104 | 4.1% |
| 8 | 104 | 4.1% |
| 9 | 78 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 348 | |
| , | 218 | |
| ; | 113 | 15.3% |
| : | 39 | 5.3% |
| " | 7 | 0.9% |
| & | 5 | 0.7% |
| ? | 3 | 0.4% |
| / | 3 | 0.4% |
| ' | 1 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 236 | |
| + | 10 | 4.0% |
| ~ | 1 | 0.4% |
| > | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 3936 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 229 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20675 | |
| Common | 7729 | 27.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3252 | |
| m | 2969 | |
| s | 2466 | |
| t | 2401 | |
| o | 1669 | |
| r | 1048 | 5.1% |
| n | 966 | 4.7% |
| b | 824 | 4.0% |
| y | 821 | 4.0% |
| a | 601 | 2.9% |
| Other values (33) | 3658 |
Common
| Value | Count | Frequency (%) |
| 3936 | ||
| 5 | 466 | 6.0% |
| 1 | 447 | 5.8% |
| 2 | 372 | 4.8% |
| 3 | 348 | 4.5% |
| . | 348 | 4.5% |
| 4 | 251 | 3.2% |
| = | 236 | 3.1% |
| - | 229 | 3.0% |
| , | 218 | 2.8% |
| Other values (17) | 878 | 11.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28404 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3936 | ||
| e | 3252 | |
| m | 2969 | 10.5% |
| s | 2466 | 8.7% |
| t | 2401 | 8.5% |
| o | 1669 | 5.9% |
| r | 1048 | 3.7% |
| n | 966 | 3.4% |
| b | 824 | 2.9% |
| y | 821 | 2.9% |
| Other values (60) | 8052 |
behavior
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 18864 |
| Missing (%) | > 99.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 64 |
|---|---|
| Median length | 56.5 |
| Mean length | 56.5 |
| Min length | 49 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | was calling while hanging from a 0.5 m tall shrub |
|---|---|
| 2nd row | was day-roosting in a dense subcanopy tree ca. 15 m above ground |
| Value | Count | Frequency (%) |
| was | 2 | 9.1% |
| a | 2 | 9.1% |
| m | 2 | 9.1% |
| in | 1 | 4.5% |
| above | 1 | 4.5% |
| 15 | 1 | 4.5% |
| ca | 1 | 4.5% |
| tree | 1 | 4.5% |
| subcanopy | 1 | 4.5% |
| dense | 1 | 4.5% |
| Other values (9) | 9 |
Most occurring characters
| Value | Count | Frequency (%) |
| 20 | ||
| a | 11 | 9.7% |
| n | 8 | 7.1% |
| o | 6 | 5.3% |
| s | 6 | 5.3% |
| e | 6 | 5.3% |
| l | 5 | 4.4% |
| i | 5 | 4.4% |
| g | 5 | 4.4% |
| r | 5 | 4.4% |
| Other values (17) | 36 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 86 | |
| Space Separator | 20 | 17.7% |
| Decimal Number | 4 | 3.5% |
| Other Punctuation | 2 | 1.8% |
| Dash Punctuation | 1 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11 | |
| n | 8 | 9.3% |
| o | 6 | 7.0% |
| s | 6 | 7.0% |
| e | 6 | 7.0% |
| l | 5 | 5.8% |
| i | 5 | 5.8% |
| g | 5 | 5.8% |
| r | 5 | 5.8% |
| t | 3 | 3.5% |
| Other values (11) | 26 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 0 | 1 | |
| 1 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 20 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 86 | |
| Common | 27 | 23.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11 | |
| n | 8 | 9.3% |
| o | 6 | 7.0% |
| s | 6 | 7.0% |
| e | 6 | 7.0% |
| l | 5 | 5.8% |
| i | 5 | 5.8% |
| g | 5 | 5.8% |
| r | 5 | 5.8% |
| t | 3 | 3.5% |
| Other values (11) | 26 |
Common
| Value | Count | Frequency (%) |
| 20 | ||
| . | 2 | 7.4% |
| 5 | 2 | 7.4% |
| 0 | 1 | 3.7% |
| - | 1 | 3.7% |
| 1 | 1 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 113 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 20 | ||
| a | 11 | 9.7% |
| n | 8 | 7.1% |
| o | 6 | 5.3% |
| s | 6 | 5.3% |
| e | 6 | 5.3% |
| l | 5 | 4.4% |
| i | 5 | 4.4% |
| g | 5 | 4.4% |
| r | 5 | 4.4% |
| Other values (17) | 36 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 37732 | |
| P | 18866 | |
| R | 18866 | |
| S | 18866 | |
| N | 18866 | |
| T | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 132062 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 37732 | |
| P | 18866 | |
| R | 18866 | |
| S | 18866 | |
| N | 18866 | |
| T | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 132062 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 37732 | |
| P | 18866 | |
| R | 18866 | |
| S | 18866 | |
| N | 18866 | |
| T | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 132062 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 37732 | |
| P | 18866 | |
| R | 18866 | |
| S | 18866 | |
| N | 18866 | |
| T | 18866 |
preparations
Text
Missing 
| Distinct | 1019 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 349 |
| Missing (%) | 1.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 262 |
|---|---|
| Median length | 190 |
| Mean length | 25.19781822 |
| Min length | 4 |
Unique
| Unique | 762 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | skin, round; skull; tissue (frozen) |
|---|---|
| 2nd row | tissue (frozen) |
| 3rd row | tissue (frozen) |
| 4th row | tissue (frozen) |
| 5th row | tissue (frozen) |
| Value | Count | Frequency (%) |
| skeleton | 13111 | |
| skull | 8315 | |
| only | 7454 | |
| skin | 6927 | |
| round | 5887 | |
| tissue | 4575 | 7.1% |
| frozen | 4435 | 6.9% |
| incomplete | 1443 | 2.2% |
| alc | 1212 | 1.9% |
| 10 | 1172 | 1.8% |
| Other values (1014) | 9705 |
Most occurring characters
| Value | Count | Frequency (%) |
| 45719 | 9.8% | |
| n | 43643 | 9.4% |
| e | 42409 | 9.1% |
| l | 42371 | 9.1% |
| s | 40001 | 8.6% |
| o | 35814 | 7.7% |
| k | 28618 | 6.1% |
| t | 22389 | 4.8% |
| u | 19801 | 4.2% |
| i | 16215 | 3.5% |
| Other values (70) | 129608 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 352064 | |
| Space Separator | 45719 | 9.8% |
| Other Punctuation | 21809 | 4.7% |
| Close Punctuation | 15985 | 3.4% |
| Open Punctuation | 15984 | 3.4% |
| Decimal Number | 7890 | 1.7% |
| Uppercase Letter | 4747 | 1.0% |
| Dash Punctuation | 1217 | 0.3% |
| Math Symbol | 1173 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 43643 | |
| e | 42409 | |
| l | 42371 | |
| s | 40001 | |
| o | 35814 | |
| k | 28618 | |
| t | 22389 | |
| u | 19801 | |
| i | 16215 | 4.6% |
| r | 13228 | 3.8% |
| Other values (16) | 47575 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1139 | |
| S | 633 | |
| L | 485 | |
| O | 365 | 7.7% |
| M | 258 | 5.4% |
| R | 249 | 5.2% |
| I | 230 | 4.8% |
| T | 229 | 4.8% |
| E | 213 | 4.5% |
| D | 125 | 2.6% |
| Other values (16) | 821 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 7821 | |
| , | 7453 | |
| . | 3527 | |
| % | 2384 | 10.9% |
| / | 316 | 1.4% |
| " | 272 | 1.2% |
| & | 18 | 0.1% |
| ' | 10 | < 0.1% |
| ? | 6 | < 0.1% |
| : | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2841 | |
| 1 | 1930 | |
| 7 | 1409 | |
| 3 | 690 | 8.7% |
| 2 | 329 | 4.2% |
| 5 | 229 | 2.9% |
| 4 | 186 | 2.4% |
| 6 | 112 | 1.4% |
| 8 | 99 | 1.3% |
| 9 | 65 | 0.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 15984 | |
| ] | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 15983 | |
| [ | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 1172 | |
| + | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 45719 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 356811 | |
| Common | 109777 | 23.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 43643 | |
| e | 42409 | |
| l | 42371 | |
| s | 40001 | |
| o | 35814 | |
| k | 28618 | |
| t | 22389 | |
| u | 19801 | 5.5% |
| i | 16215 | 4.5% |
| r | 13228 | 3.7% |
| Other values (42) | 52322 |
Common
| Value | Count | Frequency (%) |
| 45719 | ||
| ) | 15984 | 14.6% |
| ( | 15983 | 14.6% |
| ; | 7821 | 7.1% |
| , | 7453 | 6.8% |
| . | 3527 | 3.2% |
| 0 | 2841 | 2.6% |
| % | 2384 | 2.2% |
| 1 | 1930 | 1.8% |
| 7 | 1409 | 1.3% |
| Other values (18) | 4726 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 466588 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 45719 | 9.8% | |
| n | 43643 | 9.4% |
| e | 42409 | 9.1% |
| l | 42371 | 9.1% |
| s | 40001 | 8.6% |
| o | 35814 | 7.7% |
| k | 28618 | 6.1% |
| t | 22389 | 4.8% |
| u | 19801 | 4.2% |
| i | 16215 | 3.5% |
| Other values (70) | 129608 |
disposition
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.98484045 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | in collection |
|---|---|
| 2nd row | in collection |
| 3rd row | in collection |
| 4th row | in collection |
| 5th row | in collection |
| Value | Count | Frequency (%) |
| in | 18804 | |
| collection | 18804 | |
| on | 62 | 0.2% |
| loan | 38 | 0.1% |
| not | 14 | < 0.1% |
| view | 14 | < 0.1% |
| exhibit | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 37722 | |
| o | 37722 | |
| l | 37646 | |
| i | 37642 | |
| c | 37608 | |
| 18880 | ||
| e | 18828 | |
| t | 18828 | |
| a | 38 | < 0.1% |
| v | 14 | < 0.1% |
| Other values (4) | 44 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 226092 | |
| Space Separator | 18880 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 37722 | |
| o | 37722 | |
| l | 37646 | |
| i | 37642 | |
| c | 37608 | |
| e | 18828 | |
| t | 18828 | |
| a | 38 | < 0.1% |
| v | 14 | < 0.1% |
| w | 14 | < 0.1% |
| Other values (3) | 30 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 18880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 226092 | |
| Common | 18880 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 37722 | |
| o | 37722 | |
| l | 37646 | |
| i | 37642 | |
| c | 37608 | |
| e | 18828 | |
| t | 18828 | |
| a | 38 | < 0.1% |
| v | 14 | < 0.1% |
| w | 14 | < 0.1% |
| Other values (3) | 30 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 18880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 244972 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 37722 | |
| o | 37722 | |
| l | 37646 | |
| i | 37642 | |
| c | 37608 | |
| 18880 | ||
| e | 18828 | |
| t | 18828 | |
| a | 38 | < 0.1% |
| v | 14 | < 0.1% |
| Other values (4) | 44 | < 0.1% |
Missing 
| Distinct | 178 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 12450 |
| Missing (%) | 66.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 116 |
|---|---|
| Median length | 1 |
| Mean length | 8.085099751 |
| Min length | 1 |
Unique
| Unique | 75 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | | |
|---|---|
| 2nd row | | |
| 3rd row | | |
| 4th row | | |
| 5th row | | |
| Value | Count | Frequency (%) |
| 4933 | ||
| by | 1565 | 11.9% |
| det | 1461 | 11.1% |
| kristof | 303 | 2.3% |
| jordan | 300 | 2.3% |
| g | 300 | 2.3% |
| colosi | 300 | 2.3% |
| a | 296 | 2.3% |
| zyskowski | 291 | 2.2% |
| mary | 288 | 2.2% |
| Other values (171) | 3078 |
Most occurring characters
| Value | Count | Frequency (%) |
| | | 7591 | 14.6% |
| 6699 | 12.9% | |
| e | 2799 | 5.4% |
| . | 2603 | 5.0% |
| r | 2373 | 4.6% |
| y | 2262 | 4.4% |
| t | 2241 | 4.3% |
| o | 2076 | 4.0% |
| b | 1743 | 3.4% |
| D | 1738 | 3.4% |
| Other values (55) | 19749 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23551 | |
| Math Symbol | 7591 | 14.6% |
| Space Separator | 6699 | 12.9% |
| Uppercase Letter | 5999 | 11.6% |
| Other Punctuation | 4176 | 8.1% |
| Decimal Number | 3818 | 7.4% |
| Dash Punctuation | 40 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2799 | |
| r | 2373 | |
| y | 2262 | |
| t | 2241 | |
| o | 2076 | |
| b | 1743 | |
| s | 1614 | 6.9% |
| i | 1410 | 6.0% |
| a | 1357 | 5.8% |
| n | 1339 | 5.7% |
| Other values (15) | 4337 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 1738 | |
| A | 601 | 10.0% |
| K | 503 | 8.4% |
| C | 448 | 7.5% |
| J | 435 | 7.3% |
| M | 426 | 7.1% |
| G | 353 | 5.9% |
| T | 326 | 5.4% |
| Z | 303 | 5.1% |
| N | 129 | 2.2% |
| Other values (13) | 737 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1658 | |
| 2 | 1138 | |
| 8 | 277 | 7.3% |
| 9 | 275 | 7.2% |
| 1 | 251 | 6.6% |
| 7 | 132 | 3.5% |
| 6 | 33 | 0.9% |
| 4 | 25 | 0.7% |
| 3 | 21 | 0.6% |
| 5 | 8 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2603 | |
| : | 1569 | |
| ; | 3 | 0.1% |
| , | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 7591 |
Space Separator
| Value | Count | Frequency (%) |
| 6699 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 40 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29550 | |
| Common | 22324 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2799 | 9.5% |
| r | 2373 | 8.0% |
| y | 2262 | 7.7% |
| t | 2241 | 7.6% |
| o | 2076 | 7.0% |
| b | 1743 | 5.9% |
| D | 1738 | 5.9% |
| s | 1614 | 5.5% |
| i | 1410 | 4.8% |
| a | 1357 | 4.6% |
| Other values (38) | 9937 |
Common
| Value | Count | Frequency (%) |
| | | 7591 | |
| 6699 | ||
| . | 2603 | 11.7% |
| 0 | 1658 | 7.4% |
| : | 1569 | 7.0% |
| 2 | 1138 | 5.1% |
| 8 | 277 | 1.2% |
| 9 | 275 | 1.2% |
| 1 | 251 | 1.1% |
| 7 | 132 | 0.6% |
| Other values (7) | 131 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51873 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| | | 7591 | 14.6% |
| 6699 | 12.9% | |
| e | 2799 | 5.4% |
| . | 2603 | 5.0% |
| r | 2373 | 4.6% |
| y | 2262 | 4.4% |
| t | 2241 | 4.3% |
| o | 2076 | 4.0% |
| b | 1743 | 3.4% |
| D | 1738 | 3.4% |
| Other values (54) | 19748 |
None
| Value | Count | Frequency (%) |
| é | 1 |
associatedTaxa
Text
Missing 
| Distinct | 373 |
|---|---|
| Distinct (%) | 98.4% |
| Missing | 18487 |
| Missing (%) | 98.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 131 |
|---|---|
| Median length | 10 |
| Mean length | 13.77572559 |
| Min length | 10 |
Unique
| Unique | 371 ? |
|---|---|
| Unique (%) | 97.9% |
Sample
| 1st row | ENT.013766 |
|---|---|
| 2nd row | offspring: MAM.015755 |
| 3rd row | parent: MAM.015754 |
| 4th row | MAM.001438 |
| 5th row | MAM.004953 |
| Value | Count | Frequency (%) |
| part | 39 | 6.9% |
| same | 36 | 6.3% |
| specimen | 36 | 6.3% |
| of | 36 | 6.3% |
| other | 8 | 1.4% |
| parent | 7 | 1.2% |
| mam.012670 | 6 | 1.1% |
| skeleton | 3 | 0.5% |
| mam.013246|part | 3 | 0.5% |
| mam.013247|part | 3 | 0.5% |
| Other values (381) | 392 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 884 | |
| M | 775 | |
| . | 402 | 7.7% |
| A | 391 | 7.5% |
| 1 | 344 | 6.6% |
| 3 | 195 | 3.7% |
| 190 | 3.6% | |
| 9 | 173 | 3.3% |
| 2 | 166 | 3.2% |
| 5 | 148 | 2.8% |
| Other values (34) | 1553 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2394 | |
| Uppercase Letter | 1206 | |
| Lowercase Letter | 902 | 17.3% |
| Other Punctuation | 510 | 9.8% |
| Space Separator | 190 | 3.6% |
| Math Symbol | 19 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 130 | |
| p | 100 | |
| a | 98 | |
| s | 85 | |
| r | 72 | |
| m | 72 | |
| t | 70 | |
| n | 59 | |
| o | 54 | |
| f | 50 | 5.5% |
| Other values (7) | 112 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 775 | |
| A | 391 | |
| H | 7 | 0.6% |
| E | 6 | 0.5% |
| X | 5 | 0.4% |
| Y | 5 | 0.4% |
| P | 5 | 0.4% |
| R | 5 | 0.4% |
| T | 3 | 0.2% |
| S | 2 | 0.2% |
| Other values (2) | 2 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 884 | |
| 1 | 344 | 14.4% |
| 3 | 195 | 8.1% |
| 9 | 173 | 7.2% |
| 2 | 166 | 6.9% |
| 5 | 148 | 6.2% |
| 4 | 133 | 5.6% |
| 6 | 124 | 5.2% |
| 8 | 114 | 4.8% |
| 7 | 113 | 4.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 402 | |
| : | 75 | 14.7% |
| ? | 33 | 6.5% |
Space Separator
| Value | Count | Frequency (%) |
| 190 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3113 | |
| Latin | 2108 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 775 | |
| A | 391 | |
| e | 130 | 6.2% |
| p | 100 | 4.7% |
| a | 98 | 4.6% |
| s | 85 | 4.0% |
| r | 72 | 3.4% |
| m | 72 | 3.4% |
| t | 70 | 3.3% |
| n | 59 | 2.8% |
| Other values (19) | 256 | 12.1% |
Common
| Value | Count | Frequency (%) |
| 0 | 884 | |
| . | 402 | |
| 1 | 344 | 11.1% |
| 3 | 195 | 6.3% |
| 190 | 6.1% | |
| 9 | 173 | 5.6% |
| 2 | 166 | 5.3% |
| 5 | 148 | 4.8% |
| 4 | 133 | 4.3% |
| 6 | 124 | 4.0% |
| Other values (5) | 354 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 884 | |
| M | 775 | |
| . | 402 | 7.7% |
| A | 391 | 7.5% |
| 1 | 344 | 6.6% |
| 3 | 195 | 3.7% |
| 190 | 3.6% | |
| 9 | 173 | 3.3% |
| 2 | 166 | 3.2% |
| 5 | 148 | 2.8% |
| Other values (34) | 1553 |
Missing 
| Distinct | 6197 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 12652 |
| Missing (%) | 67.1% |
| Memory size | 147.5 KiB |
Length
| Max length | 224 |
|---|---|
| Median length | 124 |
| Mean length | 20.11812037 |
| Min length | 3 |
Unique
| Unique | 6180 ? |
|---|---|
| Unique (%) | 99.5% |
Sample
| 1st row | Osteo 12753 (MAM.O.12753) |
|---|---|
| 2nd row | Osteo 2583 (MAM.O.02583) |
| 3rd row | Osteo 3875 (MAM.O.03875) |
| 4th row | VP.061504 |
| 5th row | UAM 112553 |
| Value | Count | Frequency (%) |
| osteo | 4413 | |
| m | 6 | < 0.1% |
| dcm | 5 | < 0.1% |
| uam | 5 | < 0.1% |
| 14305 | 2 | < 0.1% |
| 13629 | 2 | < 0.1% |
| 9529 | 2 | < 0.1% |
| 13506 | 2 | < 0.1% |
| 54886 | 2 | < 0.1% |
| 13739 | 2 | < 0.1% |
| Other values (10870) | 11001 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 10298 | 8.2% |
| M | 10211 | 8.2% |
| 9228 | 7.4% | |
| O | 9191 | 7.4% |
| 1 | 8952 | 7.2% |
| 0 | 8355 | 6.7% |
| 3 | 6022 | 4.8% |
| 4 | 5451 | 4.4% |
| 2 | 5128 | 4.1% |
| A | 5092 | 4.1% |
| Other values (45) | 47086 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 51869 | |
| Uppercase Letter | 25108 | |
| Lowercase Letter | 18707 | 15.0% |
| Other Punctuation | 10738 | 8.6% |
| Space Separator | 9228 | 7.4% |
| Open Punctuation | 4595 | 3.7% |
| Close Punctuation | 4593 | 3.7% |
| Dash Punctuation | 176 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 10211 | |
| O | 9191 | |
| A | 5092 | |
| P | 463 | 1.8% |
| R | 100 | 0.4% |
| C | 9 | < 0.1% |
| D | 6 | < 0.1% |
| S | 6 | < 0.1% |
| Z | 6 | < 0.1% |
| U | 5 | < 0.1% |
| Other values (10) | 19 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4601 | |
| s | 4598 | |
| o | 4597 | |
| t | 4597 | |
| m | 147 | 0.8% |
| a | 83 | 0.4% |
| p | 72 | 0.4% |
| l | 2 | < 0.1% |
| r | 2 | < 0.1% |
| c | 2 | < 0.1% |
| Other values (6) | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8952 | |
| 0 | 8355 | |
| 3 | 6022 | |
| 4 | 5451 | |
| 2 | 5128 | |
| 5 | 4090 | |
| 9 | 3811 | |
| 6 | 3503 | 6.8% |
| 7 | 3393 | 6.5% |
| 8 | 3164 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10298 | |
| ; | 436 | 4.1% |
| " | 2 | < 0.1% |
| ? | 1 | < 0.1% |
| / | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9228 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4595 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4593 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 176 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 81199 | |
| Latin | 43815 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 10211 | |
| O | 9191 | |
| A | 5092 | |
| e | 4601 | |
| s | 4598 | |
| o | 4597 | |
| t | 4597 | |
| P | 463 | 1.1% |
| m | 147 | 0.3% |
| R | 100 | 0.2% |
| Other values (26) | 218 | 0.5% |
Common
| Value | Count | Frequency (%) |
| . | 10298 | |
| 9228 | ||
| 1 | 8952 | |
| 0 | 8355 | |
| 3 | 6022 | |
| 4 | 5451 | 6.7% |
| 2 | 5128 | 6.3% |
| ( | 4595 | 5.7% |
| ) | 4593 | 5.7% |
| 5 | 4090 | 5.0% |
| Other values (9) | 14487 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 125014 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 10298 | 8.2% |
| M | 10211 | 8.2% |
| 9228 | 7.4% | |
| O | 9191 | 7.4% |
| 1 | 8952 | 7.2% |
| 0 | 8355 | 6.7% |
| 3 | 6022 | 4.8% |
| 4 | 5451 | 4.4% |
| 2 | 5128 | 4.1% |
| A | 5092 | 4.1% |
| Other values (45) | 47086 |
| Distinct | 18842 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 654 |
|---|---|
| Median length | 580 |
| Mean length | 69.48706668 |
| Min length | 13 |
Unique
| Unique | 18818 ? |
|---|---|
| Unique (%) | 99.7% |
Sample
| 1st row | MAM number 17903; female; personal specimen number MFH 162; testes 5 x 2 mm |
|---|---|
| 2nd row | MAM number 17889; female |
| 3rd row | MAM number 17897; male |
| 4th row | MAM number 17895; male |
| 5th row | MAM number 17888; female |
| Value | Count | Frequency (%) |
| number | 29739 | 16.2% |
| mam | 18873 | 10.3% |
| original | 6652 | 3.6% |
| catalog | 6652 | 3.6% |
| male | 5021 | 2.7% |
| osteo | 4618 | 2.5% |
| specimen | 4419 | 2.4% |
| personal | 4201 | 2.3% |
| female | 4185 | 2.3% |
| accn=ypm.12236 | 2399 | 1.3% |
| Other values (25895) | 96369 |
Most occurring characters
| Value | Count | Frequency (%) |
| 164262 | 12.5% | |
| e | 84610 | 6.5% |
| n | 72932 | 5.6% |
| a | 67392 | 5.1% |
| M | 58097 | 4.4% |
| m | 52492 | 4.0% |
| r | 52119 | 4.0% |
| c | 44063 | 3.4% |
| 1 | 42870 | 3.3% |
| o | 40086 | 3.1% |
| Other values (74) | 632020 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 689220 | |
| Decimal Number | 220996 | 16.9% |
| Space Separator | 164262 | 12.5% |
| Uppercase Letter | 139813 | 10.7% |
| Other Punctuation | 71758 | 5.5% |
| Math Symbol | 13316 | 1.0% |
| Open Punctuation | 5176 | 0.4% |
| Close Punctuation | 5170 | 0.4% |
| Dash Punctuation | 1232 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 84610 | |
| n | 72932 | |
| a | 67392 | |
| m | 52492 | 7.6% |
| r | 52119 | 7.6% |
| c | 44063 | 6.4% |
| o | 40086 | 5.8% |
| l | 39877 | 5.8% |
| u | 37915 | 5.5% |
| b | 35664 | 5.2% |
| Other values (16) | 162070 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 58097 | |
| A | 29360 | |
| P | 10732 | 7.7% |
| O | 9348 | 6.7% |
| Y | 8914 | 6.4% |
| V | 3906 | 2.8% |
| Z | 3819 | 2.7% |
| R | 3460 | 2.5% |
| S | 2466 | 1.8% |
| B | 1899 | 1.4% |
| Other values (16) | 7812 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 42870 | |
| 0 | 35267 | |
| 2 | 23803 | |
| 3 | 21594 | |
| 4 | 21563 | |
| 6 | 20089 | |
| 5 | 16601 | 7.5% |
| 7 | 14226 | 6.4% |
| 9 | 13052 | 5.9% |
| 8 | 11931 | 5.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 39221 | |
| . | 27314 | |
| , | 4064 | 5.7% |
| : | 788 | 1.1% |
| / | 142 | 0.2% |
| ? | 123 | 0.2% |
| " | 56 | 0.1% |
| ' | 32 | < 0.1% |
| & | 18 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 13301 | |
| + | 10 | 0.1% |
| ± | 3 | < 0.1% |
| ~ | 1 | < 0.1% |
| > | 1 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5170 | |
| [ | 5 | 0.1% |
| { | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5164 | |
| ] | 5 | 0.1% |
| } | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 164262 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1232 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 829033 | |
| Common | 481910 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 84610 | 10.2% |
| n | 72932 | 8.8% |
| a | 67392 | 8.1% |
| M | 58097 | 7.0% |
| m | 52492 | 6.3% |
| r | 52119 | 6.3% |
| c | 44063 | 5.3% |
| o | 40086 | 4.8% |
| l | 39877 | 4.8% |
| u | 37915 | 4.6% |
| Other values (42) | 279450 |
Common
| Value | Count | Frequency (%) |
| 164262 | ||
| 1 | 42870 | 8.9% |
| ; | 39221 | 8.1% |
| 0 | 35267 | 7.3% |
| . | 27314 | 5.7% |
| 2 | 23803 | 4.9% |
| 3 | 21594 | 4.5% |
| 4 | 21563 | 4.5% |
| 6 | 20089 | 4.2% |
| 5 | 16601 | 3.4% |
| Other values (22) | 69326 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1310940 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 164262 | 12.5% | |
| e | 84610 | 6.5% |
| n | 72932 | 5.6% |
| a | 67392 | 5.1% |
| M | 58097 | 4.4% |
| m | 52492 | 4.0% |
| r | 52119 | 4.0% |
| c | 44063 | 3.4% |
| 1 | 42870 | 3.3% |
| o | 40086 | 3.1% |
| Other values (73) | 632017 |
None
| Value | Count | Frequency (%) |
| ± | 3 |
| Distinct | 3180 |
|---|---|
| Distinct (%) | 17.0% |
| Missing | 152 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 135 |
|---|---|
| Median length | 105 |
| Mean length | 29.92679278 |
| Min length | 3 |
Unique
| Unique | 1569 ? |
|---|---|
| Unique (%) | 8.4% |
Sample
| 1st row | Tamias striatus fisheri |
|---|---|
| 2nd row | Peromyscus leucopus noveboracensis |
| 3rd row | Peromyscus leucopus noveboracensis |
| 4th row | Peromyscus leucopus noveboracensis |
| 5th row | Peromyscus leucopus noveboracensis |
| Value | Count | Frequency (%) |
| peromyscus | 1837 | 3.4% |
| gapperi | 1530 | 2.8% |
| cinereus | 1460 | 2.7% |
| brevicauda | 1361 | 2.5% |
| sorex | 1193 | 2.2% |
| blarina | 976 | 1.8% |
| maniculatus | 919 | 1.7% |
| zibethicus | 906 | 1.7% |
| leucopus | 836 | 1.6% |
| talpoides | 759 | 1.4% |
| Other values (3630) | 42002 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 54787 | 9.8% |
| i | 49194 | 8.8% |
| a | 47829 | 8.5% |
| u | 39741 | 7.1% |
| e | 39634 | 7.1% |
| r | 35507 | 6.3% |
| 35065 | 6.3% | |
| o | 33086 | 5.9% |
| n | 28079 | 5.0% |
| c | 25884 | 4.6% |
| Other values (45) | 171244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 491072 | |
| Space Separator | 35065 | 6.3% |
| Uppercase Letter | 26308 | 4.7% |
| Math Symbol | 7591 | 1.4% |
| Other Punctuation | 10 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 54787 | |
| i | 49194 | |
| a | 47829 | |
| u | 39741 | 8.1% |
| e | 39634 | 8.1% |
| r | 35507 | 7.2% |
| o | 33086 | 6.7% |
| n | 28079 | 5.7% |
| c | 25884 | 5.3% |
| l | 22447 | 4.6% |
| Other values (16) | 114884 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3888 | |
| C | 3681 | |
| M | 3187 | |
| S | 2683 | |
| B | 1796 | 6.8% |
| T | 1608 | 6.1% |
| O | 1561 | 5.9% |
| N | 1123 | 4.3% |
| A | 925 | 3.5% |
| L | 925 | 3.5% |
| Other values (14) | 4931 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8 | |
| ? | 2 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 35065 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 7591 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 517380 | |
| Common | 42670 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 54787 | 10.6% |
| i | 49194 | 9.5% |
| a | 47829 | 9.2% |
| u | 39741 | 7.7% |
| e | 39634 | 7.7% |
| r | 35507 | 6.9% |
| o | 33086 | 6.4% |
| n | 28079 | 5.4% |
| c | 25884 | 5.0% |
| l | 22447 | 4.3% |
| Other values (40) | 141192 |
Common
| Value | Count | Frequency (%) |
| 35065 | ||
| | | 7591 | 17.8% |
| . | 8 | < 0.1% |
| - | 4 | < 0.1% |
| ? | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 560050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 54787 | 9.8% |
| i | 49194 | 8.8% |
| a | 47829 | 8.5% |
| u | 39741 | 7.1% |
| e | 39634 | 7.1% |
| r | 35507 | 6.3% |
| 35065 | 6.3% | |
| o | 33086 | 5.9% |
| n | 28079 | 5.0% |
| c | 25884 | 4.6% |
| Other values (45) | 171244 |
fieldNumber
Text
Missing 
| Distinct | 5159 |
|---|---|
| Distinct (%) | 70.6% |
| Missing | 11555 |
| Missing (%) | 61.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 22 |
|---|---|
| Median length | 16 |
| Mean length | 4.113664341 |
| Min length | 1 |
Unique
| Unique | 4249 ? |
|---|---|
| Unique (%) | 58.1% |
Sample
| 1st row | 14251 |
|---|---|
| 2nd row | P5 |
| 3rd row | P14 |
| 4th row | P12 |
| 5th row | P4 |
| Value | Count | Frequency (%) |
| f | 452 | 5.3% |
| r | 169 | 2.0% |
| l | 162 | 1.9% |
| mcz | 50 | 0.6% |
| 2 | 44 | 0.5% |
| 3 | 43 | 0.5% |
| 1 | 42 | 0.5% |
| 5 | 38 | 0.4% |
| jas | 32 | 0.4% |
| 4 | 31 | 0.4% |
| Other values (4656) | 7419 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4503 | |
| 3 | 2724 | |
| 4 | 2723 | |
| 2 | 2604 | |
| 0 | 2480 | |
| 8 | 2138 | 7.1% |
| 9 | 1836 | 6.1% |
| 7 | 1835 | 6.1% |
| 5 | 1834 | 6.1% |
| 6 | 1742 | 5.8% |
| Other values (58) | 5656 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 24419 | |
| Uppercase Letter | 3351 | 11.1% |
| Space Separator | 1171 | 3.9% |
| Dash Punctuation | 829 | 2.8% |
| Lowercase Letter | 148 | 0.5% |
| Open Punctuation | 53 | 0.2% |
| Close Punctuation | 53 | 0.2% |
| Other Punctuation | 51 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 853 | |
| R | 343 | |
| Q | 331 | 9.9% |
| A | 229 | 6.8% |
| M | 198 | 5.9% |
| L | 178 | 5.3% |
| B | 172 | 5.1% |
| Z | 145 | 4.3% |
| C | 133 | 4.0% |
| P | 131 | 3.9% |
| Other values (16) | 638 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 24 | |
| m | 17 | |
| l | 16 | |
| e | 13 | |
| o | 12 | 8.1% |
| t | 10 | 6.8% |
| r | 8 | 5.4% |
| i | 7 | 4.7% |
| n | 6 | 4.1% |
| p | 5 | 3.4% |
| Other values (10) | 30 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4503 | |
| 3 | 2724 | |
| 4 | 2723 | |
| 2 | 2604 | |
| 0 | 2480 | |
| 8 | 2138 | |
| 9 | 1836 | |
| 7 | 1835 | |
| 5 | 1834 | |
| 6 | 1742 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 29 | |
| ? | 9 | 17.6% |
| / | 7 | 13.7% |
| # | 3 | 5.9% |
| ; | 2 | 3.9% |
| : | 1 | 2.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 52 | |
| ( | 1 | 1.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 52 | |
| ) | 1 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1171 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 829 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26576 | |
| Latin | 3499 | 11.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 853 | |
| R | 343 | |
| Q | 331 | 9.5% |
| A | 229 | 6.5% |
| M | 198 | 5.7% |
| L | 178 | 5.1% |
| B | 172 | 4.9% |
| Z | 145 | 4.1% |
| C | 133 | 3.8% |
| P | 131 | 3.7% |
| Other values (36) | 786 |
Common
| Value | Count | Frequency (%) |
| 1 | 4503 | |
| 3 | 2724 | |
| 4 | 2723 | |
| 2 | 2604 | |
| 0 | 2480 | |
| 8 | 2138 | |
| 9 | 1836 | |
| 7 | 1835 | |
| 5 | 1834 | |
| 6 | 1742 | 6.6% |
| Other values (12) | 2157 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30075 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4503 | |
| 3 | 2724 | |
| 4 | 2723 | |
| 2 | 2604 | |
| 0 | 2480 | |
| 8 | 2138 | 7.1% |
| 9 | 1836 | 6.1% |
| 7 | 1835 | 6.1% |
| 5 | 1834 | 6.1% |
| 6 | 1742 | 5.8% |
| Other values (58) | 5656 |
eventDate
Text
Missing 
| Distinct | 3828 |
|---|---|
| Distinct (%) | 31.1% |
| Missing | 6567 |
| Missing (%) | 34.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 9.544759737 |
| Min length | 4 |
Unique
| Unique | 2041 ? |
|---|---|
| Unique (%) | 16.6% |
Sample
| 1st row | 2024-08-15 |
|---|---|
| 2nd row | 2023-12-01 |
| 3rd row | 2023-12-28 |
| 4th row | 2023-12-20 |
| 5th row | 2023-11-30 |
| Value | Count | Frequency (%) |
| 2012-07-18 | 178 | 1.4% |
| 2012-07-15 | 170 | 1.4% |
| 1959 | 163 | 1.3% |
| 2012-07-16 | 150 | 1.2% |
| 2012-07-24 | 144 | 1.2% |
| 2013-08-02 | 109 | 0.9% |
| 2020-10-07 | 108 | 0.9% |
| 2020-10-14 | 100 | 0.8% |
| 2020-10-15 | 96 | 0.8% |
| 2020-10-08 | 96 | 0.8% |
| Other values (3818) | 10985 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 22585 | |
| 0 | 21802 | |
| 1 | 20655 | |
| 2 | 12689 | |
| 9 | 10314 | |
| 7 | 5694 | 4.9% |
| 6 | 5638 | 4.8% |
| 5 | 5198 | 4.4% |
| 3 | 4638 | 4.0% |
| 8 | 4585 | 3.9% |
| Other values (2) | 3593 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 94718 | |
| Dash Punctuation | 22585 | 19.2% |
| Other Punctuation | 88 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 21802 | |
| 1 | 20655 | |
| 2 | 12689 | |
| 9 | 10314 | |
| 7 | 5694 | 6.0% |
| 6 | 5638 | 6.0% |
| 5 | 5198 | 5.5% |
| 3 | 4638 | 4.9% |
| 8 | 4585 | 4.8% |
| 4 | 3505 | 3.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 22585 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 88 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 117391 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 22585 | |
| 0 | 21802 | |
| 1 | 20655 | |
| 2 | 12689 | |
| 9 | 10314 | |
| 7 | 5694 | 4.9% |
| 6 | 5638 | 4.8% |
| 5 | 5198 | 4.4% |
| 3 | 4638 | 4.0% |
| 8 | 4585 | 3.9% |
| Other values (2) | 3593 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117391 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 22585 | |
| 0 | 21802 | |
| 1 | 20655 | |
| 2 | 12689 | |
| 9 | 10314 | |
| 7 | 5694 | 4.9% |
| 6 | 5638 | 4.8% |
| 5 | 5198 | 4.4% |
| 3 | 4638 | 4.0% |
| 8 | 4585 | 3.9% |
| Other values (2) | 3593 | 3.1% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 7901 |
| Missing (%) | 41.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.816598267 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 228 |
|---|---|
| 2nd row | 335 |
| 3rd row | 362 |
| 4th row | 354 |
| 5th row | 334 |
| Value | Count | Frequency (%) |
| 200 | 253 | 2.3% |
| 197 | 230 | 2.1% |
| 198 | 219 | 2.0% |
| 206 | 207 | 1.9% |
| 214 | 178 | 1.6% |
| 190 | 131 | 1.2% |
| 194 | 123 | 1.1% |
| 281 | 121 | 1.1% |
| 282 | 113 | 1.0% |
| 288 | 113 | 1.0% |
| Other values (356) | 9277 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6647 | |
| 1 | 5594 | |
| 3 | 3147 | |
| 9 | 2617 | 8.5% |
| 0 | 2608 | 8.4% |
| 8 | 2580 | 8.4% |
| 7 | 2170 | 7.0% |
| 4 | 1963 | 6.4% |
| 5 | 1825 | 5.9% |
| 6 | 1733 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 30884 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6647 | |
| 1 | 5594 | |
| 3 | 3147 | |
| 9 | 2617 | 8.5% |
| 0 | 2608 | 8.4% |
| 8 | 2580 | 8.4% |
| 7 | 2170 | 7.0% |
| 4 | 1963 | 6.4% |
| 5 | 1825 | 5.9% |
| 6 | 1733 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 30884 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6647 | |
| 1 | 5594 | |
| 3 | 3147 | |
| 9 | 2617 | 8.5% |
| 0 | 2608 | 8.4% |
| 8 | 2580 | 8.4% |
| 7 | 2170 | 7.0% |
| 4 | 1963 | 6.4% |
| 5 | 1825 | 5.9% |
| 6 | 1733 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30884 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6647 | |
| 1 | 5594 | |
| 3 | 3147 | |
| 9 | 2617 | 8.5% |
| 0 | 2608 | 8.4% |
| 8 | 2580 | 8.4% |
| 7 | 2170 | 7.0% |
| 4 | 1963 | 6.4% |
| 5 | 1825 | 5.9% |
| 6 | 1733 | 5.6% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 3.3% |
| Missing | 7901 |
| Missing (%) | 41.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.816415869 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 228 |
|---|---|
| 2nd row | 335 |
| 3rd row | 362 |
| 4th row | 354 |
| 5th row | 334 |
| Value | Count | Frequency (%) |
| 200 | 253 | 2.3% |
| 197 | 230 | 2.1% |
| 198 | 219 | 2.0% |
| 206 | 207 | 1.9% |
| 214 | 178 | 1.6% |
| 190 | 131 | 1.2% |
| 194 | 123 | 1.1% |
| 281 | 121 | 1.1% |
| 282 | 113 | 1.0% |
| 288 | 113 | 1.0% |
| Other values (356) | 9277 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 6646 | |
| 1 | 5597 | |
| 3 | 3140 | |
| 9 | 2609 | 8.4% |
| 0 | 2608 | 8.4% |
| 8 | 2544 | 8.2% |
| 7 | 2202 | 7.1% |
| 4 | 1921 | 6.2% |
| 5 | 1856 | 6.0% |
| 6 | 1759 | 5.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 30882 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6646 | |
| 1 | 5597 | |
| 3 | 3140 | |
| 9 | 2609 | 8.4% |
| 0 | 2608 | 8.4% |
| 8 | 2544 | 8.2% |
| 7 | 2202 | 7.1% |
| 4 | 1921 | 6.2% |
| 5 | 1856 | 6.0% |
| 6 | 1759 | 5.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 30882 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 6646 | |
| 1 | 5597 | |
| 3 | 3140 | |
| 9 | 2609 | 8.4% |
| 0 | 2608 | 8.4% |
| 8 | 2544 | 8.2% |
| 7 | 2202 | 7.1% |
| 4 | 1921 | 6.2% |
| 5 | 1856 | 6.0% |
| 6 | 1759 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30882 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 6646 | |
| 1 | 5597 | |
| 3 | 3140 | |
| 9 | 2609 | 8.4% |
| 0 | 2608 | 8.4% |
| 8 | 2544 | 8.2% |
| 7 | 2202 | 7.1% |
| 4 | 1921 | 6.2% |
| 5 | 1856 | 6.0% |
| 6 | 1759 | 5.7% |
year
Text
Missing 
| Distinct | 156 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 6572 |
| Missing (%) | 34.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2024 |
|---|---|
| 2nd row | 2023 |
| 3rd row | 2023 |
| 4th row | 2023 |
| 5th row | 2023 |
| Value | Count | Frequency (%) |
| 2013 | 864 | 7.0% |
| 2012 | 821 | 6.7% |
| 2020 | 800 | 6.5% |
| 2014 | 728 | 5.9% |
| 1965 | 712 | 5.8% |
| 1962 | 340 | 2.8% |
| 1956 | 325 | 2.6% |
| 1964 | 288 | 2.3% |
| 1959 | 284 | 2.3% |
| 1952 | 274 | 2.2% |
| Other values (146) | 6858 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 11749 | |
| 9 | 8330 | |
| 2 | 7669 | |
| 0 | 6637 | |
| 5 | 3388 | 6.9% |
| 6 | 3295 | 6.7% |
| 3 | 2758 | 5.6% |
| 7 | 1878 | 3.8% |
| 4 | 1818 | 3.7% |
| 8 | 1654 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 49176 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 11749 | |
| 9 | 8330 | |
| 2 | 7669 | |
| 0 | 6637 | |
| 5 | 3388 | 6.9% |
| 6 | 3295 | 6.7% |
| 3 | 2758 | 5.6% |
| 7 | 1878 | 3.8% |
| 4 | 1818 | 3.7% |
| 8 | 1654 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 49176 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 11749 | |
| 9 | 8330 | |
| 2 | 7669 | |
| 0 | 6637 | |
| 5 | 3388 | 6.9% |
| 6 | 3295 | 6.7% |
| 3 | 2758 | 5.6% |
| 7 | 1878 | 3.8% |
| 4 | 1818 | 3.7% |
| 8 | 1654 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49176 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 11749 | |
| 9 | 8330 | |
| 2 | 7669 | |
| 0 | 6637 | |
| 5 | 3388 | 6.9% |
| 6 | 3295 | 6.7% |
| 3 | 2758 | 5.6% |
| 7 | 1878 | 3.8% |
| 4 | 1818 | 3.7% |
| 8 | 1654 | 3.4% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7472 |
| Missing (%) | 39.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.204318062 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 8 |
|---|---|
| 2nd row | 12 |
| 3rd row | 12 |
| 4th row | 12 |
| 5th row | 11 |
| Value | Count | Frequency (%) |
| 7 | 2621 | |
| 8 | 1678 | |
| 10 | 1318 | |
| 6 | 1172 | |
| 9 | 828 | 7.3% |
| 1 | 718 | 6.3% |
| 11 | 605 | 5.3% |
| 5 | 553 | 4.9% |
| 3 | 508 | 4.5% |
| 4 | 496 | 4.4% |
| Other values (2) | 897 | 7.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3651 | |
| 7 | 2621 | |
| 8 | 1678 | |
| 0 | 1318 | 9.6% |
| 6 | 1172 | 8.5% |
| 2 | 897 | 6.5% |
| 9 | 828 | 6.0% |
| 5 | 553 | 4.0% |
| 3 | 508 | 3.7% |
| 4 | 496 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13722 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3651 | |
| 7 | 2621 | |
| 8 | 1678 | |
| 0 | 1318 | 9.6% |
| 6 | 1172 | 8.5% |
| 2 | 897 | 6.5% |
| 9 | 828 | 6.0% |
| 5 | 553 | 4.0% |
| 3 | 508 | 3.7% |
| 4 | 496 | 3.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13722 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3651 | |
| 7 | 2621 | |
| 8 | 1678 | |
| 0 | 1318 | 9.6% |
| 6 | 1172 | 8.5% |
| 2 | 897 | 6.5% |
| 9 | 828 | 6.0% |
| 5 | 553 | 4.0% |
| 3 | 508 | 3.7% |
| 4 | 496 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13722 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3651 | |
| 7 | 2621 | |
| 8 | 1678 | |
| 0 | 1318 | 9.6% |
| 6 | 1172 | 8.5% |
| 2 | 897 | 6.5% |
| 9 | 828 | 6.0% |
| 5 | 553 | 4.0% |
| 3 | 508 | 3.7% |
| 4 | 496 | 3.6% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 7989 |
| Missing (%) | 42.3% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.67812816 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 15 |
|---|---|
| 2nd row | 1 |
| 3rd row | 28 |
| 4th row | 20 |
| 5th row | 30 |
| Value | Count | Frequency (%) |
| 18 | 551 | 5.1% |
| 15 | 518 | 4.8% |
| 7 | 470 | 4.3% |
| 16 | 465 | 4.3% |
| 8 | 445 | 4.1% |
| 9 | 433 | 4.0% |
| 24 | 428 | 3.9% |
| 2 | 425 | 3.9% |
| 19 | 410 | 3.8% |
| 4 | 386 | 3.5% |
| Other values (21) | 6346 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5053 | |
| 2 | 3974 | |
| 3 | 1359 | 7.4% |
| 8 | 1236 | 6.8% |
| 4 | 1179 | 6.5% |
| 5 | 1143 | 6.3% |
| 7 | 1122 | 6.1% |
| 6 | 1111 | 6.1% |
| 9 | 1085 | 5.9% |
| 0 | 991 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18253 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5053 | |
| 2 | 3974 | |
| 3 | 1359 | 7.4% |
| 8 | 1236 | 6.8% |
| 4 | 1179 | 6.5% |
| 5 | 1143 | 6.3% |
| 7 | 1122 | 6.1% |
| 6 | 1111 | 6.1% |
| 9 | 1085 | 5.9% |
| 0 | 991 | 5.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18253 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5053 | |
| 2 | 3974 | |
| 3 | 1359 | 7.4% |
| 8 | 1236 | 6.8% |
| 4 | 1179 | 6.5% |
| 5 | 1143 | 6.3% |
| 7 | 1122 | 6.1% |
| 6 | 1111 | 6.1% |
| 9 | 1085 | 5.9% |
| 0 | 991 | 5.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5053 | |
| 2 | 3974 | |
| 3 | 1359 | 7.4% |
| 8 | 1236 | 6.8% |
| 4 | 1179 | 6.5% |
| 5 | 1143 | 6.3% |
| 7 | 1122 | 6.1% |
| 6 | 1111 | 6.1% |
| 9 | 1085 | 5.9% |
| 0 | 991 | 5.4% |
habitat
Text
Missing 
| Distinct | 49 |
|---|---|
| Distinct (%) | 38.6% |
| Missing | 18739 |
| Missing (%) | 99.3% |
| Memory size | 147.5 KiB |
Length
| Max length | 185 |
|---|---|
| Median length | 88 |
| Mean length | 16.97637795 |
| Min length | 5 |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | 29.9% |
Sample
| 1st row | Urban |
|---|---|
| 2nd row | Urban |
| 3rd row | Urban |
| 4th row | Urban |
| 5th row | Urban |
| Value | Count | Frequency (%) |
| urban | 50 | 14.2% |
| in | 21 | 5.9% |
| suburban | 18 | 5.1% |
| forest | 10 | 2.8% |
| by | 8 | 2.3% |
| pine | 7 | 2.0% |
| open | 6 | 1.7% |
| of | 6 | 1.7% |
| ponderosa | 6 | 1.7% |
| soil | 5 | 1.4% |
| Other values (132) | 216 |
Most occurring characters
| Value | Count | Frequency (%) |
| 226 | 10.5% | |
| a | 205 | 9.5% |
| n | 189 | 8.8% |
| r | 178 | 8.3% |
| e | 162 | 7.5% |
| o | 131 | 6.1% |
| s | 119 | 5.5% |
| b | 116 | 5.4% |
| i | 105 | 4.9% |
| t | 98 | 4.5% |
| Other values (36) | 627 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1794 | |
| Space Separator | 226 | 10.5% |
| Uppercase Letter | 100 | 4.6% |
| Other Punctuation | 31 | 1.4% |
| Decimal Number | 3 | 0.1% |
| Dash Punctuation | 2 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 205 | |
| n | 189 | |
| r | 178 | |
| e | 162 | 9.0% |
| o | 131 | 7.3% |
| s | 119 | 6.6% |
| b | 116 | 6.5% |
| i | 105 | 5.9% |
| t | 98 | 5.5% |
| d | 81 | 4.5% |
| Other values (14) | 410 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 50 | |
| S | 19 | 19.0% |
| P | 11 | 11.0% |
| W | 5 | 5.0% |
| R | 3 | 3.0% |
| B | 3 | 3.0% |
| C | 3 | 3.0% |
| E | 2 | 2.0% |
| V | 1 | 1.0% |
| F | 1 | 1.0% |
| Other values (2) | 2 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 20 | |
| . | 4 | 12.9% |
| ; | 3 | 9.7% |
| " | 2 | 6.5% |
| : | 1 | 3.2% |
| ' | 1 | 3.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 1 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 226 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1894 | |
| Common | 262 | 12.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 205 | |
| n | 189 | 10.0% |
| r | 178 | 9.4% |
| e | 162 | 8.6% |
| o | 131 | 6.9% |
| s | 119 | 6.3% |
| b | 116 | 6.1% |
| i | 105 | 5.5% |
| t | 98 | 5.2% |
| d | 81 | 4.3% |
| Other values (26) | 510 |
Common
| Value | Count | Frequency (%) |
| 226 | ||
| , | 20 | 7.6% |
| . | 4 | 1.5% |
| ; | 3 | 1.1% |
| " | 2 | 0.8% |
| - | 2 | 0.8% |
| 0 | 2 | 0.8% |
| : | 1 | 0.4% |
| 1 | 1 | 0.4% |
| ' | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2156 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 226 | 10.5% | |
| a | 205 | 9.5% |
| n | 189 | 8.8% |
| r | 178 | 8.3% |
| e | 162 | 7.5% |
| o | 131 | 6.1% |
| s | 119 | 5.5% |
| b | 116 | 5.4% |
| i | 105 | 4.9% |
| t | 98 | 4.5% |
| Other values (36) | 627 |
higherGeography
Text
Missing 
| Distinct | 951 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 3778 |
| Missing (%) | 20.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 74 |
|---|---|
| Median length | 66 |
| Mean length | 40.53313892 |
| Min length | 4 |
Unique
| Unique | 314 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | North America; USA; Connecticut; New Haven County |
|---|---|
| 2nd row | North America; USA; Connecticut; Middlesex County |
| 3rd row | North America; USA; Connecticut; Middlesex County |
| 4th row | North America; USA; Connecticut; Middlesex County |
| 5th row | North America; USA; Connecticut; Middlesex County |
| Value | Count | Frequency (%) |
| america | 11919 | |
| north | 11535 | |
| usa | 10091 | 11.9% |
| county | 9449 | 11.1% |
| new | 4323 | 5.1% |
| hampshire | 2881 | 3.4% |
| carroll | 2750 | 3.2% |
| africa | 2011 | 2.4% |
| connecticut | 1497 | 1.8% |
| province | 1319 | 1.6% |
| Other values (974) | 27017 |
Most occurring characters
| Value | Count | Frequency (%) |
| 69704 | 11.4% | |
| r | 45359 | 7.4% |
| a | 42864 | 7.0% |
| o | 40987 | 6.7% |
| ; | 38761 | 6.3% |
| e | 35621 | 5.8% |
| i | 34064 | 5.6% |
| t | 32303 | 5.3% |
| n | 28583 | 4.7% |
| A | 26232 | 4.3% |
| Other values (53) | 217086 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 397956 | |
| Uppercase Letter | 105028 | 17.2% |
| Space Separator | 69704 | 11.4% |
| Other Punctuation | 38819 | 6.3% |
| Dash Punctuation | 57 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 45359 | |
| a | 42864 | |
| o | 40987 | |
| e | 35621 | |
| i | 34064 | |
| t | 32303 | 8.1% |
| n | 28583 | 7.2% |
| c | 22577 | 5.7% |
| h | 17759 | 4.5% |
| m | 16806 | 4.2% |
| Other values (20) | 81033 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 26232 | |
| C | 17618 | |
| N | 16467 | |
| S | 12399 | |
| U | 10244 | 9.8% |
| H | 3948 | 3.8% |
| M | 2894 | 2.8% |
| P | 2510 | 2.4% |
| E | 1256 | 1.2% |
| G | 1207 | 1.1% |
| Other values (16) | 10253 | 9.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 38761 | |
| . | 28 | 0.1% |
| ' | 28 | 0.1% |
| & | 1 | < 0.1% |
| , | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 69704 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 502984 | |
| Common | 108580 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 45359 | 9.0% |
| a | 42864 | 8.5% |
| o | 40987 | 8.1% |
| e | 35621 | 7.1% |
| i | 34064 | 6.8% |
| t | 32303 | 6.4% |
| n | 28583 | 5.7% |
| A | 26232 | 5.2% |
| c | 22577 | 4.5% |
| h | 17759 | 3.5% |
| Other values (46) | 176635 |
Common
| Value | Count | Frequency (%) |
| 69704 | ||
| ; | 38761 | |
| - | 57 | 0.1% |
| . | 28 | < 0.1% |
| ' | 28 | < 0.1% |
| & | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 611460 | |
| None | 104 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 69704 | 11.4% | |
| r | 45359 | 7.4% |
| a | 42864 | 7.0% |
| o | 40987 | 6.7% |
| ; | 38761 | 6.3% |
| e | 35621 | 5.8% |
| i | 34064 | 5.6% |
| t | 32303 | 5.3% |
| n | 28583 | 4.7% |
| A | 26232 | 4.3% |
| Other values (48) | 216982 |
None
| Value | Count | Frequency (%) |
| á | 72 | |
| í | 16 | 15.4% |
| é | 11 | 10.6% |
| ó | 4 | 3.8% |
| Á | 1 | 1.0% |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3874 |
| Missing (%) | 20.5% |
| Memory size | 147.5 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.49086179 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 11386 | |
| africa | 1991 | 13.3% |
| asia | 648 | 4.3% |
| south_america | 537 | 3.6% |
| europe | 279 | 1.9% |
| oceania | 150 | 1.0% |
| antarctica | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 29427 | |
| R | 25580 | |
| I | 14713 | |
| C | 14066 | |
| E | 12631 | |
| O | 12352 | |
| T | 11925 | |
| H | 11923 | |
| _ | 11923 | |
| M | 11923 | |
| Other values (5) | 15808 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 160348 | |
| Connector Punctuation | 11923 | 6.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 29427 | |
| R | 25580 | |
| I | 14713 | |
| C | 14066 | |
| E | 12631 | |
| O | 12352 | |
| T | 11925 | |
| H | 11923 | |
| M | 11923 | |
| N | 11537 | 7.2% |
| Other values (4) | 4271 | 2.7% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 11923 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 160348 | |
| Common | 11923 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 29427 | |
| R | 25580 | |
| I | 14713 | |
| C | 14066 | |
| E | 12631 | |
| O | 12352 | |
| T | 11925 | |
| H | 11923 | |
| M | 11923 | |
| N | 11537 | 7.2% |
| Other values (4) | 4271 | 2.7% |
Common
| Value | Count | Frequency (%) |
| _ | 11923 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172271 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 29427 | |
| R | 25580 | |
| I | 14713 | |
| C | 14066 | |
| E | 12631 | |
| O | 12352 | |
| T | 11925 | |
| H | 11923 | |
| _ | 11923 | |
| M | 11923 | |
| Other values (5) | 15808 |
waterBody
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 18739 |
| Missing (%) | 99.3% |
| Memory size | 147.5 KiB |
Length
| Max length | 38 |
|---|---|
| Median length | 29 |
| Mean length | 23.07874016 |
| Min length | 12 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | Atlantic Ocean; Caribbean Sea |
|---|---|
| 2nd row | Atlantic Ocean; Caribbean Sea |
| 3rd row | Atlantic Ocean; Caribbean Sea |
| 4th row | Atlantic Ocean; Caribbean Sea |
| 5th row | Atlantic Ocean; Caribbean Sea |
| Value | Count | Frequency (%) |
| ocean | 127 | |
| atlantic | 87 | |
| sea | 79 | |
| caribbean | 78 | |
| pacific | 30 | 7.2% |
| indian | 9 | 2.2% |
| arctic | 1 | 0.2% |
| red | 1 | 0.2% |
| gulf | 1 | 0.2% |
| of | 1 | 0.2% |
| Other values (2) | 2 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 490 | |
| n | 312 | |
| 289 | ||
| e | 287 | |
| c | 277 | |
| i | 236 | |
| t | 176 | 6.0% |
| b | 156 | 5.3% |
| O | 127 | 4.3% |
| A | 88 | 3.0% |
| Other values (15) | 493 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2147 | |
| Uppercase Letter | 415 | 14.2% |
| Space Separator | 289 | 9.9% |
| Other Punctuation | 80 | 2.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 490 | |
| n | 312 | |
| e | 287 | |
| c | 277 | |
| i | 236 | |
| t | 176 | 8.2% |
| b | 156 | 7.3% |
| l | 88 | 4.1% |
| r | 80 | 3.7% |
| f | 32 | 1.5% |
| Other values (4) | 13 | 0.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 127 | |
| A | 88 | |
| S | 80 | |
| C | 78 | |
| P | 30 | 7.2% |
| I | 9 | 2.2% |
| R | 1 | 0.2% |
| G | 1 | 0.2% |
| L | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 289 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 80 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2562 | |
| Common | 369 | 12.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 490 | |
| n | 312 | |
| e | 287 | |
| c | 277 | |
| i | 236 | |
| t | 176 | 6.9% |
| b | 156 | 6.1% |
| O | 127 | 5.0% |
| A | 88 | 3.4% |
| l | 88 | 3.4% |
| Other values (13) | 325 |
Common
| Value | Count | Frequency (%) |
| 289 | ||
| ; | 80 | 21.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2931 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 490 | |
| n | 312 | |
| 289 | ||
| e | 287 | |
| c | 277 | |
| i | 236 | |
| t | 176 | 6.0% |
| b | 156 | 5.3% |
| O | 127 | 4.3% |
| A | 88 | 3.0% |
| Other values (15) | 493 |
countryCode
Text
Missing 
| Distinct | 105 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 3974 |
| Missing (%) | 21.1% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 17 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 10088 | |
| ca | 686 | 4.6% |
| ke | 667 | 4.5% |
| mx | 578 | 3.9% |
| eg | 430 | 2.9% |
| id | 279 | 1.9% |
| cm | 254 | 1.7% |
| ec | 238 | 1.6% |
| gr | 138 | 0.9% |
| au | 112 | 0.8% |
| Other values (95) | 1422 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 10251 | |
| S | 10220 | |
| E | 1412 | 4.7% |
| C | 1380 | 4.6% |
| M | 1059 | 3.6% |
| A | 911 | 3.1% |
| G | 811 | 2.7% |
| K | 743 | 2.5% |
| X | 578 | 1.9% |
| I | 432 | 1.5% |
| Other values (16) | 1987 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 29784 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 10251 | |
| S | 10220 | |
| E | 1412 | 4.7% |
| C | 1380 | 4.6% |
| M | 1059 | 3.6% |
| A | 911 | 3.1% |
| G | 811 | 2.7% |
| K | 743 | 2.5% |
| X | 578 | 1.9% |
| I | 432 | 1.5% |
| Other values (16) | 1987 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29784 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 10251 | |
| S | 10220 | |
| E | 1412 | 4.7% |
| C | 1380 | 4.6% |
| M | 1059 | 3.6% |
| A | 911 | 3.1% |
| G | 811 | 2.7% |
| K | 743 | 2.5% |
| X | 578 | 1.9% |
| I | 432 | 1.5% |
| Other values (16) | 1987 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29784 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 10251 | |
| S | 10220 | |
| E | 1412 | 4.7% |
| C | 1380 | 4.6% |
| M | 1059 | 3.6% |
| A | 911 | 3.1% |
| G | 811 | 2.7% |
| K | 743 | 2.5% |
| X | 578 | 1.9% |
| I | 432 | 1.5% |
| Other values (16) | 1987 | 6.7% |
stateProvince
Text
Missing 
| Distinct | 260 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 5347 |
| Missing (%) | 28.3% |
| Memory size | 147.5 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 25 |
| Mean length | 11.24032843 |
| Min length | 3 |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | Connecticut |
|---|---|
| 2nd row | Connecticut |
| 3rd row | Connecticut |
| 4th row | Connecticut |
| 5th row | Connecticut |
| Value | Count | Frequency (%) |
| new | 3586 | |
| hampshire | 2877 | 13.8% |
| connecticut | 1497 | 7.2% |
| province | 1288 | 6.2% |
| state | 613 | 2.9% |
| minnesota | 580 | 2.8% |
| york | 506 | 2.4% |
| colorado | 463 | 2.2% |
| arizona | 438 | 2.1% |
| wisconsin | 425 | 2.0% |
| Other values (287) | 8636 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14301 | 9.4% |
| a | 13747 | 9.0% |
| i | 12892 | 8.5% |
| n | 10792 | 7.1% |
| o | 10626 | 7.0% |
| r | 9355 | 6.2% |
| t | 8029 | 5.3% |
| s | 7835 | 5.2% |
| 7390 | 4.9% | |
| c | 5643 | 3.7% |
| Other values (48) | 51348 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 123691 | |
| Uppercase Letter | 20861 | 13.7% |
| Space Separator | 7390 | 4.9% |
| Dash Punctuation | 15 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14301 | |
| a | 13747 | |
| i | 12892 | |
| n | 10792 | |
| o | 10626 | |
| r | 9355 | 7.6% |
| t | 8029 | 6.5% |
| s | 7835 | 6.3% |
| c | 5643 | 4.6% |
| h | 4465 | 3.6% |
| Other values (20) | 26006 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4092 | |
| C | 3272 | |
| H | 2932 | |
| P | 1777 | |
| M | 1528 | 7.3% |
| A | 1144 | 5.5% |
| S | 846 | 4.1% |
| W | 788 | 3.8% |
| V | 591 | 2.8% |
| Y | 507 | 2.4% |
| Other values (15) | 3384 |
Space Separator
| Value | Count | Frequency (%) |
| 7390 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 144552 | |
| Common | 7406 | 4.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14301 | 9.9% |
| a | 13747 | 9.5% |
| i | 12892 | 8.9% |
| n | 10792 | 7.5% |
| o | 10626 | 7.4% |
| r | 9355 | 6.5% |
| t | 8029 | 5.6% |
| s | 7835 | 5.4% |
| c | 5643 | 3.9% |
| h | 4465 | 3.1% |
| Other values (45) | 46867 |
Common
| Value | Count | Frequency (%) |
| 7390 | ||
| - | 15 | 0.2% |
| ' | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 151865 | |
| None | 93 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 14301 | 9.4% |
| a | 13747 | 9.1% |
| i | 12892 | 8.5% |
| n | 10792 | 7.1% |
| o | 10626 | 7.0% |
| r | 9355 | 6.2% |
| t | 8029 | 5.3% |
| s | 7835 | 5.2% |
| 7390 | 4.9% | |
| c | 5643 | 3.7% |
| Other values (44) | 51255 |
None
| Value | Count | Frequency (%) |
| á | 72 | |
| í | 16 | 17.2% |
| ó | 3 | 3.2% |
| é | 2 | 2.2% |
county
Text
Missing 
| Distinct | 484 |
|---|---|
| Distinct (%) | 5.0% |
| Missing | 9192 |
| Missing (%) | 48.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 28 |
|---|---|
| Median length | 27 |
| Mean length | 14.43198263 |
| Min length | 6 |
Unique
| Unique | 154 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | New Haven County |
|---|---|
| 2nd row | Middlesex County |
| 3rd row | Middlesex County |
| 4th row | Middlesex County |
| 5th row | Middlesex County |
| Value | Count | Frequency (%) |
| county | 9433 | |
| carroll | 2750 | 13.3% |
| new | 705 | 3.4% |
| haven | 655 | 3.2% |
| cass | 356 | 1.7% |
| litchfield | 334 | 1.6% |
| gunnison | 275 | 1.3% |
| fairfield | 220 | 1.1% |
| iron | 203 | 1.0% |
| middlesex | 167 | 0.8% |
| Other values (517) | 5606 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 15785 | |
| n | 14029 | |
| C | 13107 | |
| t | 11232 | 8.0% |
| 11030 | 7.9% | |
| u | 10798 | 7.7% |
| y | 9773 | 7.0% |
| r | 8601 | 6.2% |
| l | 7999 | 5.7% |
| a | 7541 | 5.4% |
| Other values (47) | 29720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 107635 | |
| Uppercase Letter | 20854 | 14.9% |
| Space Separator | 11030 | 7.9% |
| Other Punctuation | 55 | < 0.1% |
| Dash Punctuation | 41 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 15785 | |
| n | 14029 | |
| t | 11232 | |
| u | 10798 | |
| y | 9773 | |
| r | 8601 | |
| l | 7999 | |
| a | 7541 | |
| e | 5545 | 5.2% |
| i | 3786 | 3.5% |
| Other values (18) | 12546 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 13107 | |
| H | 924 | 4.4% |
| L | 867 | 4.2% |
| N | 832 | 4.0% |
| S | 670 | 3.2% |
| M | 609 | 2.9% |
| F | 535 | 2.6% |
| G | 503 | 2.4% |
| P | 490 | 2.3% |
| B | 456 | 2.2% |
| Other values (15) | 1861 | 8.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 28 | |
| ' | 27 |
Space Separator
| Value | Count | Frequency (%) |
| 11030 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 128489 | |
| Common | 11126 | 8.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 15785 | |
| n | 14029 | |
| C | 13107 | |
| t | 11232 | |
| u | 10798 | |
| y | 9773 | |
| r | 8601 | 6.7% |
| l | 7999 | 6.2% |
| a | 7541 | 5.9% |
| e | 5545 | 4.3% |
| Other values (43) | 24079 |
Common
| Value | Count | Frequency (%) |
| 11030 | ||
| - | 41 | 0.4% |
| . | 28 | 0.3% |
| ' | 27 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 139604 | |
| None | 11 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 15785 | |
| n | 14029 | |
| C | 13107 | |
| t | 11232 | 8.0% |
| 11030 | 7.9% | |
| u | 10798 | 7.7% |
| y | 9773 | 7.0% |
| r | 8601 | 6.2% |
| l | 7999 | 5.7% |
| a | 7541 | 5.4% |
| Other values (44) | 29709 |
None
| Value | Count | Frequency (%) |
| é | 9 | |
| Á | 1 | 9.1% |
| ó | 1 | 9.1% |
municipality
Text
Missing 
| Distinct | 93 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 18309 |
| Missing (%) | 97.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 25 |
|---|---|
| Median length | 19 |
| Mean length | 8.47935368 |
| Min length | 4 |
Unique
| Unique | 37 ? |
|---|---|
| Unique (%) | 6.6% |
Sample
| 1st row | Redding |
|---|---|
| 2nd row | Hamden |
| 3rd row | Hamden |
| 4th row | Perkasie |
| 5th row | Philadelphia |
| Value | Count | Frequency (%) |
| parksville | 56 | 8.5% |
| fairfield | 39 | 5.9% |
| westport | 35 | 5.3% |
| kent | 32 | 4.9% |
| norwalk | 29 | 4.4% |
| lloyd | 27 | 4.1% |
| harbor | 27 | 4.1% |
| new | 25 | 3.8% |
| quince | 24 | 3.6% |
| mil | 24 | 3.6% |
| Other values (98) | 340 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 421 | 8.9% |
| e | 410 | 8.7% |
| a | 396 | 8.4% |
| r | 359 | 7.6% |
| i | 356 | 7.5% |
| o | 282 | 6.0% |
| n | 258 | 5.5% |
| t | 205 | 4.3% |
| s | 184 | 3.9% |
| d | 163 | 3.5% |
| Other values (39) | 1689 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3961 | |
| Uppercase Letter | 658 | 13.9% |
| Space Separator | 101 | 2.1% |
| Other Punctuation | 2 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 421 | |
| e | 410 | |
| a | 396 | |
| r | 359 | 9.1% |
| i | 356 | 9.0% |
| o | 282 | 7.1% |
| n | 258 | 6.5% |
| t | 205 | 5.2% |
| s | 184 | 4.6% |
| d | 163 | 4.1% |
| Other values (13) | 927 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 108 | |
| N | 59 | |
| W | 58 | 8.8% |
| M | 55 | 8.4% |
| H | 49 | 7.4% |
| F | 43 | 6.5% |
| L | 41 | 6.2% |
| K | 36 | 5.5% |
| B | 35 | 5.3% |
| Q | 28 | 4.3% |
| Other values (12) | 146 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1 | |
| & | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 101 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4619 | |
| Common | 104 | 2.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 421 | 9.1% |
| e | 410 | 8.9% |
| a | 396 | 8.6% |
| r | 359 | 7.8% |
| i | 356 | 7.7% |
| o | 282 | 6.1% |
| n | 258 | 5.6% |
| t | 205 | 4.4% |
| s | 184 | 4.0% |
| d | 163 | 3.5% |
| Other values (35) | 1585 |
Common
| Value | Count | Frequency (%) |
| 101 | ||
| , | 1 | 1.0% |
| - | 1 | 1.0% |
| & | 1 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4723 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 421 | 8.9% |
| e | 410 | 8.7% |
| a | 396 | 8.4% |
| r | 359 | 7.6% |
| i | 356 | 7.5% |
| o | 282 | 6.0% |
| n | 258 | 5.5% |
| t | 205 | 4.3% |
| s | 184 | 3.9% |
| d | 163 | 3.5% |
| Other values (39) | 1689 |
locality
Text
Missing 
| Distinct | 2520 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 5869 |
| Missing (%) | 31.1% |
| Memory size | 147.5 KiB |
Length
| Max length | 136 |
|---|---|
| Median length | 96 |
| Mean length | 26.34000154 |
| Min length | 3 |
Unique
| Unique | 1275 ? |
|---|---|
| Unique (%) | 9.8% |
Sample
| 1st row | New Haven. Yale University, Peabody Museum |
|---|---|
| 2nd row | Clinton. 245 Killingworth Turnpike |
| 3rd row | Clinton. 245 Killingworth Turnpike |
| 4th row | Clinton. 245 Killingworth Turnpike |
| 5th row | Clinton. 245 Killingworth Turnpike |
| Value | Count | Frequency (%) |
| forest | 3560 | 6.6% |
| experimental | 2766 | 5.1% |
| bartlett | 2744 | 5.1% |
| of | 2288 | 4.2% |
| comp | 1856 | 3.4% |
| miles | 929 | 1.7% |
| transect | 736 | 1.4% |
| mi | 727 | 1.3% |
| national | 657 | 1.2% |
| island | 533 | 1.0% |
| Other values (3204) | 37538 |
Most occurring characters
| Value | Count | Frequency (%) |
| 41355 | 12.1% | |
| e | 29260 | 8.5% |
| a | 26473 | 7.7% |
| t | 25066 | 7.3% |
| o | 21028 | 6.1% |
| r | 19414 | 5.7% |
| n | 17045 | 5.0% |
| i | 15970 | 4.7% |
| l | 15843 | 4.6% |
| s | 12163 | 3.6% |
| Other values (76) | 118724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 240976 | |
| Space Separator | 41355 | 12.1% |
| Uppercase Letter | 38660 | 11.3% |
| Decimal Number | 10790 | 3.2% |
| Other Punctuation | 9066 | 2.6% |
| Dash Punctuation | 863 | 0.3% |
| Open Punctuation | 261 | 0.1% |
| Close Punctuation | 261 | 0.1% |
| Math Symbol | 109 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 29260 | |
| a | 26473 | |
| t | 25066 | |
| o | 21028 | |
| r | 19414 | |
| n | 17045 | 7.1% |
| i | 15970 | 6.6% |
| l | 15843 | 6.6% |
| s | 12163 | 5.0% |
| m | 10319 | 4.3% |
| Other values (20) | 48395 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 4122 | |
| C | 4107 | |
| B | 4072 | |
| E | 3542 | 9.2% |
| S | 2650 | 6.9% |
| M | 2630 | 6.8% |
| N | 2411 | 6.2% |
| R | 2164 | 5.6% |
| P | 1557 | 4.0% |
| A | 1375 | 3.6% |
| Other values (16) | 10030 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6402 | |
| , | 1832 | 20.2% |
| / | 501 | 5.5% |
| ' | 120 | 1.3% |
| ? | 66 | 0.7% |
| ; | 48 | 0.5% |
| " | 40 | 0.4% |
| & | 31 | 0.3% |
| : | 22 | 0.2% |
| # | 4 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2029 | |
| 5 | 1908 | |
| 4 | 1565 | |
| 0 | 1214 | |
| 3 | 942 | |
| 6 | 915 | |
| 2 | 913 | |
| 7 | 511 | 4.7% |
| 8 | 475 | 4.4% |
| 9 | 318 | 2.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 212 | |
| ) | 48 | 18.4% |
| } | 1 | 0.4% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 105 | |
| ~ | 2 | 1.8% |
| + | 2 | 1.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 213 | |
| ( | 48 | 18.4% |
Space Separator
| Value | Count | Frequency (%) |
| 41355 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 863 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 279636 | |
| Common | 62705 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 29260 | 10.5% |
| a | 26473 | 9.5% |
| t | 25066 | 9.0% |
| o | 21028 | 7.5% |
| r | 19414 | 6.9% |
| n | 17045 | 6.1% |
| i | 15970 | 5.7% |
| l | 15843 | 5.7% |
| s | 12163 | 4.3% |
| m | 10319 | 3.7% |
| Other values (46) | 87055 |
Common
| Value | Count | Frequency (%) |
| 41355 | ||
| . | 6402 | 10.2% |
| 1 | 2029 | 3.2% |
| 5 | 1908 | 3.0% |
| , | 1832 | 2.9% |
| 4 | 1565 | 2.5% |
| 0 | 1214 | 1.9% |
| 3 | 942 | 1.5% |
| 6 | 915 | 1.5% |
| 2 | 913 | 1.5% |
| Other values (20) | 3630 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 342324 | |
| None | 17 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 41355 | 12.1% | |
| e | 29260 | 8.5% |
| a | 26473 | 7.7% |
| t | 25066 | 7.3% |
| o | 21028 | 6.1% |
| r | 19414 | 5.7% |
| n | 17045 | 5.0% |
| i | 15970 | 4.7% |
| l | 15843 | 4.6% |
| s | 12163 | 3.6% |
| Other values (72) | 118707 |
None
| Value | Count | Frequency (%) |
| í | 8 | |
| ç | 4 | |
| ö | 4 | |
| á | 1 | 5.9% |
Missing 
| Distinct | 195 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 17391 |
| Missing (%) | 92.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 8.446779661 |
| Min length | 4 |
Unique
| Unique | 57 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | 200-200 ft |
|---|---|
| 2nd row | 200-200 ft |
| 3rd row | 638 m |
| 4th row | 638 m |
| 5th row | 1143 m |
| Value | Count | Frequency (%) |
| m | 858 | |
| ft | 617 | |
| 200-200 | 104 | 3.5% |
| 1829 | 84 | 2.8% |
| 700 | 58 | 2.0% |
| 638 | 56 | 1.9% |
| 2134 | 56 | 1.9% |
| 6000-6000 | 40 | 1.4% |
| 500 | 39 | 1.3% |
| 2896 | 33 | 1.1% |
| Other values (172) | 1005 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3601 | |
| 1475 | ||
| m | 858 | 6.9% |
| - | 784 | 6.3% |
| 2 | 739 | 5.9% |
| 1 | 683 | 5.5% |
| f | 617 | 5.0% |
| t | 617 | 5.0% |
| 8 | 490 | 3.9% |
| 4 | 490 | 3.9% |
| Other values (5) | 2105 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8108 | |
| Lowercase Letter | 2092 | 16.8% |
| Space Separator | 1475 | 11.8% |
| Dash Punctuation | 784 | 6.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3601 | |
| 2 | 739 | 9.1% |
| 1 | 683 | 8.4% |
| 8 | 490 | 6.0% |
| 4 | 490 | 6.0% |
| 5 | 468 | 5.8% |
| 3 | 458 | 5.6% |
| 6 | 449 | 5.5% |
| 9 | 375 | 4.6% |
| 7 | 355 | 4.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 858 | |
| f | 617 | |
| t | 617 |
Space Separator
| Value | Count | Frequency (%) |
| 1475 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 784 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10367 | |
| Latin | 2092 | 16.8% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3601 | |
| 1475 | ||
| - | 784 | 7.6% |
| 2 | 739 | 7.1% |
| 1 | 683 | 6.6% |
| 8 | 490 | 4.7% |
| 4 | 490 | 4.7% |
| 5 | 468 | 4.5% |
| 3 | 458 | 4.4% |
| 6 | 449 | 4.3% |
| Other values (2) | 730 | 7.0% |
Latin
| Value | Count | Frequency (%) |
| m | 858 | |
| f | 617 | |
| t | 617 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12459 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3601 | |
| 1475 | ||
| m | 858 | 6.9% |
| - | 784 | 6.3% |
| 2 | 739 | 5.9% |
| 1 | 683 | 5.5% |
| f | 617 | 5.0% |
| t | 617 | 5.0% |
| 8 | 490 | 3.9% |
| 4 | 490 | 3.9% |
| Other values (5) | 2105 |
decimalLatitude
Text
Missing 
| Distinct | 2246 |
|---|---|
| Distinct (%) | 16.9% |
| Missing | 5543 |
| Missing (%) | 29.4% |
| Memory size | 147.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 7.516775501 |
| Min length | 3 |
Unique
| Unique | 1060 ? |
|---|---|
| Unique (%) | 8.0% |
Sample
| 1st row | 41.358889 |
|---|---|
| 2nd row | 41.358889 |
| 3rd row | 40.280472 |
| 4th row | 39.966055 |
| 5th row | 40.280472 |
| Value | Count | Frequency (%) |
| 44.049466 | 311 | 2.3% |
| 44.059277 | 252 | 1.9% |
| 44.062155 | 245 | 1.8% |
| 3.9167 | 244 | 1.8% |
| 44.05088 | 232 | 1.7% |
| 44.061185 | 228 | 1.7% |
| 44.041766 | 222 | 1.7% |
| 44.059944 | 204 | 1.5% |
| 41.3931 | 147 | 1.1% |
| 41.3081 | 130 | 1.0% |
| Other values (2207) | 11108 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 15803 | |
| . | 13323 | |
| 3 | 11104 | |
| 1 | 8831 | |
| 6 | 8363 | |
| 5 | 7455 | |
| 0 | 7343 | |
| 7 | 6908 | |
| 2 | 6851 | |
| 8 | 6767 | |
| Other values (2) | 7398 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 85426 | |
| Other Punctuation | 13323 | 13.3% |
| Dash Punctuation | 1397 | 1.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 15803 | |
| 3 | 11104 | |
| 1 | 8831 | |
| 6 | 8363 | |
| 5 | 7455 | |
| 0 | 7343 | |
| 7 | 6908 | |
| 2 | 6851 | |
| 8 | 6767 | |
| 9 | 6001 | 7.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13323 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1397 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 100146 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 15803 | |
| . | 13323 | |
| 3 | 11104 | |
| 1 | 8831 | |
| 6 | 8363 | |
| 5 | 7455 | |
| 0 | 7343 | |
| 7 | 6908 | |
| 2 | 6851 | |
| 8 | 6767 | |
| Other values (2) | 7398 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 100146 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 15803 | |
| . | 13323 | |
| 3 | 11104 | |
| 1 | 8831 | |
| 6 | 8363 | |
| 5 | 7455 | |
| 0 | 7343 | |
| 7 | 6908 | |
| 2 | 6851 | |
| 8 | 6767 | |
| Other values (2) | 7398 |
decimalLongitude
Text
Missing 
| Distinct | 2284 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 5543 |
| Missing (%) | 29.4% |
| Memory size | 147.5 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 8.606995421 |
| Min length | 3 |
Unique
| Unique | 1091 ? |
|---|---|
| Unique (%) | 8.2% |
Sample
| 1st row | -72.903807 |
|---|---|
| 2nd row | -72.903807 |
| 3rd row | -75.050684 |
| 4th row | -75.195683 |
| 5th row | -75.050684 |
| Value | Count | Frequency (%) |
| 71.27383 | 311 | 2.3% |
| 71.304611 | 252 | 1.9% |
| 71.297795 | 245 | 1.8% |
| 136.1667 | 244 | 1.8% |
| 71.307927 | 232 | 1.7% |
| 71.303074 | 228 | 1.7% |
| 71.319924 | 222 | 1.7% |
| 71.308122 | 204 | 1.5% |
| 71.290348 | 160 | 1.2% |
| 72.8972 | 148 | 1.1% |
| Other values (2267) | 11077 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 14318 | |
| 7 | 13516 | |
| . | 13323 | |
| 3 | 11679 | |
| - | 11234 | |
| 2 | 8832 | |
| 6 | 7982 | |
| 9 | 7758 | |
| 0 | 7552 | |
| 8 | 7489 | |
| Other values (2) | 10988 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 90114 | |
| Other Punctuation | 13323 | 11.6% |
| Dash Punctuation | 11234 | 9.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 14318 | |
| 7 | 13516 | |
| 3 | 11679 | |
| 2 | 8832 | |
| 6 | 7982 | |
| 9 | 7758 | |
| 0 | 7552 | |
| 8 | 7489 | |
| 5 | 5508 | 6.1% |
| 4 | 5480 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13323 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11234 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 114671 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 14318 | |
| 7 | 13516 | |
| . | 13323 | |
| 3 | 11679 | |
| - | 11234 | |
| 2 | 8832 | |
| 6 | 7982 | |
| 9 | 7758 | |
| 0 | 7552 | |
| 8 | 7489 | |
| Other values (2) | 10988 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114671 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 14318 | |
| 7 | 13516 | |
| . | 13323 | |
| 3 | 11679 | |
| - | 11234 | |
| 2 | 8832 | |
| 6 | 7982 | |
| 9 | 7758 | |
| 0 | 7552 | |
| 8 | 7489 | |
| Other values (2) | 10988 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 475 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 5609 |
| Missing (%) | 29.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 6.103341631 |
| Min length | 4 |
Unique
| Unique | 227 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | 5359.0 |
|---|---|
| 2nd row | 5359.0 |
| 3rd row | 5359.0 |
| 4th row | 5359.0 |
| 5th row | 5359.0 |
| Value | Count | Frequency (%) |
| 1850.0 | 5476 | |
| 1851.0 | 4930 | |
| 111111.0 | 329 | 2.5% |
| 3036.0 | 110 | 0.8% |
| 1583.0 | 104 | 0.8% |
| 301.0 | 97 | 0.7% |
| 103733.0 | 86 | 0.6% |
| 5000.0 | 84 | 0.6% |
| 300.0 | 79 | 0.6% |
| 500.0 | 66 | 0.5% |
| Other values (465) | 1896 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 20535 | |
| 1 | 19011 | |
| . | 13257 | |
| 5 | 11398 | |
| 8 | 11362 | |
| 3 | 1449 | 1.8% |
| 4 | 978 | 1.2% |
| 7 | 822 | 1.0% |
| 6 | 746 | 0.9% |
| 2 | 681 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 67655 | |
| Other Punctuation | 13257 | 16.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 20535 | |
| 1 | 19011 | |
| 5 | 11398 | |
| 8 | 11362 | |
| 3 | 1449 | 2.1% |
| 4 | 978 | 1.4% |
| 7 | 822 | 1.2% |
| 6 | 746 | 1.1% |
| 2 | 681 | 1.0% |
| 9 | 673 | 1.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13257 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 80912 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 20535 | |
| 1 | 19011 | |
| . | 13257 | |
| 5 | 11398 | |
| 8 | 11362 | |
| 3 | 1449 | 1.8% |
| 4 | 978 | 1.2% |
| 7 | 822 | 1.0% |
| 6 | 746 | 0.9% |
| 2 | 681 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 80912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 20535 | |
| 1 | 19011 | |
| . | 13257 | |
| 5 | 11398 | |
| 8 | 11362 | |
| 3 | 1449 | 1.8% |
| 4 | 978 | 1.2% |
| 7 | 822 | 1.0% |
| 6 | 746 | 0.9% |
| 2 | 681 | 0.8% |
georeferencedBy
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 18537 |
| Missing (%) | 98.3% |
| Memory size | 147.5 KiB |
Length
| Max length | 26 |
|---|---|
| Median length | 17 |
| Mean length | 17.73860182 |
| Min length | 13 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Piper L. Stepule |
|---|---|
| 2nd row | Piper L. Stepule |
| 3rd row | Peter A. Capainolo |
| 4th row | Kristof Zyskowski |
| 5th row | Nicholas J. Kerhoulas |
| Value | Count | Frequency (%) |
| kristof | 233 | |
| zyskowski | 233 | |
| j | 37 | 5.0% |
| gregory | 24 | 3.3% |
| watkins-colwell | 24 | 3.3% |
| peter | 22 | 3.0% |
| a | 22 | 3.0% |
| capainolo | 22 | 3.0% |
| dornburg | 14 | 1.9% |
| alex | 14 | 1.9% |
| Other values (26) | 93 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 761 | |
| o | 607 | 10.4% |
| i | 545 | 9.3% |
| k | 497 | 8.5% |
| 409 | 7.0% | |
| r | 364 | 6.2% |
| t | 294 | 5.0% |
| y | 269 | 4.6% |
| w | 263 | 4.5% |
| K | 251 | 4.3% |
| Other values (32) | 1576 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4560 | |
| Uppercase Letter | 763 | 13.1% |
| Space Separator | 409 | 7.0% |
| Other Punctuation | 80 | 1.4% |
| Dash Punctuation | 24 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 761 | |
| o | 607 | |
| i | 545 | |
| k | 497 | |
| r | 364 | |
| t | 294 | 6.4% |
| y | 269 | 5.9% |
| w | 263 | 5.8% |
| f | 234 | 5.1% |
| e | 163 | 3.6% |
| Other values (13) | 563 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 251 | |
| Z | 233 | |
| C | 49 | 6.4% |
| J | 43 | 5.6% |
| A | 39 | 5.1% |
| P | 33 | 4.3% |
| W | 30 | 3.9% |
| G | 24 | 3.1% |
| D | 17 | 2.2% |
| S | 13 | 1.7% |
| Other values (6) | 31 | 4.1% |
Space Separator
| Value | Count | Frequency (%) |
| 409 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 80 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5323 | |
| Common | 513 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 761 | |
| o | 607 | |
| i | 545 | |
| k | 497 | |
| r | 364 | 6.8% |
| t | 294 | 5.5% |
| y | 269 | 5.1% |
| w | 263 | 4.9% |
| K | 251 | 4.7% |
| f | 234 | 4.4% |
| Other values (29) | 1238 |
Common
| Value | Count | Frequency (%) |
| 409 | ||
| . | 80 | 15.6% |
| - | 24 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5836 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 761 | |
| o | 607 | 10.4% |
| i | 545 | 9.3% |
| k | 497 | 8.5% |
| 409 | 7.0% | |
| r | 364 | 6.2% |
| t | 294 | 5.0% |
| y | 269 | 4.6% |
| w | 263 | 4.5% |
| K | 251 | 4.3% |
| Other values (32) | 1576 |
Missing 
| Distinct | 48 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 10549 |
| Missing (%) | 55.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.131417578 |
| Min length | 4 |
Unique
| Unique | 16 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
| Value | Count | Frequency (%) |
| 2023-12-28 | 5807 | |
| 2015 | 1204 | 14.5% |
| 2020-06-14 | 935 | 11.2% |
| 2020-12-30 | 124 | 1.5% |
| 2023-12-03 | 45 | 0.5% |
| 2021-12-08 | 27 | 0.3% |
| 2024-01-17 | 18 | 0.2% |
| 2024-05-01 | 17 | 0.2% |
| 2019-11-04 | 16 | 0.2% |
| 2022-06-18 | 14 | 0.2% |
| Other values (38) | 110 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 27323 | |
| - | 14226 | |
| 0 | 10723 | 14.1% |
| 1 | 8422 | 11.1% |
| 3 | 6079 | 8.0% |
| 8 | 5869 | 7.7% |
| 5 | 1236 | 1.6% |
| 4 | 1028 | 1.4% |
| 6 | 994 | 1.3% |
| 7 | 24 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 61720 | |
| Dash Punctuation | 14226 | 18.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 27323 | |
| 0 | 10723 | 17.4% |
| 1 | 8422 | 13.6% |
| 3 | 6079 | 9.8% |
| 8 | 5869 | 9.5% |
| 5 | 1236 | 2.0% |
| 4 | 1028 | 1.7% |
| 6 | 994 | 1.6% |
| 7 | 24 | < 0.1% |
| 9 | 22 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14226 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 75946 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 27323 | |
| - | 14226 | |
| 0 | 10723 | 14.1% |
| 1 | 8422 | 11.1% |
| 3 | 6079 | 8.0% |
| 8 | 5869 | 7.7% |
| 5 | 1236 | 1.6% |
| 4 | 1028 | 1.4% |
| 6 | 994 | 1.3% |
| 7 | 24 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 75946 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 27323 | |
| - | 14226 | |
| 0 | 10723 | 14.1% |
| 1 | 8422 | 11.1% |
| 3 | 6079 | 8.0% |
| 8 | 5869 | 7.7% |
| 5 | 1236 | 1.6% |
| 4 | 1028 | 1.4% |
| 6 | 994 | 1.3% |
| 7 | 24 | < 0.1% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5610 |
| Missing (%) | 29.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 13.75980688 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | digital resource |
|---|---|
| 2nd row | digital resource |
| 3rd row | digital resource |
| 4th row | digital resource |
| 5th row | digital resource |
| Value | Count | Frequency (%) |
| resource | 7300 | |
| digital | 7216 | |
| unspecified | 5956 | |
| physical | 84 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 26512 | |
| i | 26428 | |
| r | 14600 | 8.0% |
| s | 13340 | 7.3% |
| c | 13340 | 7.3% |
| u | 13256 | 7.3% |
| d | 13172 | 7.2% |
| 7300 | 4.0% | |
| l | 7300 | 4.0% |
| a | 7300 | 4.0% |
| Other values (8) | 39852 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 175100 | |
| Space Separator | 7300 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 26512 | |
| i | 26428 | |
| r | 14600 | |
| s | 13340 | |
| c | 13340 | |
| u | 13256 | |
| d | 13172 | |
| l | 7300 | 4.2% |
| a | 7300 | 4.2% |
| o | 7300 | 4.2% |
| Other values (7) | 32552 |
Space Separator
| Value | Count | Frequency (%) |
| 7300 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 175100 | |
| Common | 7300 | 4.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 26512 | |
| i | 26428 | |
| r | 14600 | |
| s | 13340 | |
| c | 13340 | |
| u | 13256 | |
| d | 13172 | |
| l | 7300 | 4.2% |
| a | 7300 | 4.2% |
| o | 7300 | 4.2% |
| Other values (7) | 32552 |
Common
| Value | Count | Frequency (%) |
| 7300 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 182400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 26512 | |
| i | 26428 | |
| r | 14600 | 8.0% |
| s | 13340 | 7.3% |
| c | 13340 | 7.3% |
| u | 13256 | 7.3% |
| d | 13172 | 7.2% |
| 7300 | 4.0% | |
| l | 7300 | 4.0% |
| a | 7300 | 4.0% |
| Other values (8) | 39852 |
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 5615 |
| Missing (%) | 29.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 21 |
|---|---|
| Median length | 15 |
| Mean length | 9.898347295 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NEVP |
|---|---|
| 2nd row | NEVP |
| 3rd row | NEVP |
| 4th row | NEVP |
| 5th row | NEVP |
| Value | Count | Frequency (%) |
| unspecified | 5957 | |
| unit | 3838 | |
| gps | 3838 | |
| geolocate | 1254 | 6.7% |
| 785 | 4.2% | |
| earth | 713 | 3.8% |
| vertnet | 649 | 3.5% |
| 2014 | 290 | 1.5% |
| census | 290 | 1.5% |
| tiger | 290 | 1.5% |
| Other values (11) | 847 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 16291 | |
| e | 15708 | |
| n | 10145 | 7.7% |
| u | 10138 | 7.7% |
| c | 7413 | 5.7% |
| t | 7162 | 5.5% |
| s | 6614 | 5.0% |
| p | 6243 | 4.8% |
| G | 6167 | 4.7% |
| d | 6099 | 4.6% |
| Other values (32) | 39183 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 101255 | |
| Uppercase Letter | 23104 | 17.6% |
| Space Separator | 5500 | 4.2% |
| Decimal Number | 1160 | 0.9% |
| Other Punctuation | 144 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 16291 | |
| e | 15708 | |
| n | 10145 | |
| u | 10138 | |
| c | 7413 | |
| t | 7162 | |
| s | 6614 | |
| p | 6243 | 6.2% |
| d | 6099 | 6.0% |
| f | 5957 | 5.9% |
| Other values (10) | 9485 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 6167 | |
| S | 4128 | |
| P | 4106 | |
| E | 2531 | |
| L | 1254 | 5.4% |
| O | 1254 | 5.4% |
| N | 923 | 4.0% |
| V | 917 | 4.0% |
| T | 296 | 1.3% |
| C | 290 | 1.3% |
| Other values (6) | 1238 | 5.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 290 | |
| 1 | 290 | |
| 0 | 290 | |
| 2 | 290 |
Space Separator
| Value | Count | Frequency (%) |
| 5500 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 144 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 124359 | |
| Common | 6804 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 16291 | |
| e | 15708 | |
| n | 10145 | 8.2% |
| u | 10138 | 8.2% |
| c | 7413 | 6.0% |
| t | 7162 | 5.8% |
| s | 6614 | 5.3% |
| p | 6243 | 5.0% |
| G | 6167 | 5.0% |
| d | 6099 | 4.9% |
| Other values (26) | 32379 |
Common
| Value | Count | Frequency (%) |
| 5500 | ||
| 4 | 290 | 4.3% |
| 1 | 290 | 4.3% |
| 0 | 290 | 4.3% |
| 2 | 290 | 4.3% |
| . | 144 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 131163 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 16291 | |
| e | 15708 | |
| n | 10145 | 7.7% |
| u | 10138 | 7.7% |
| c | 7413 | 5.7% |
| t | 7162 | 5.5% |
| s | 6614 | 5.0% |
| p | 6243 | 4.8% |
| G | 6167 | 4.7% |
| d | 6099 | 4.6% |
| Other values (32) | 39183 |
Missing 
| Distinct | 562 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 5661 |
| Missing (%) | 30.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 570 |
|---|---|
| Median length | 446 |
| Mean length | 102.1251799 |
| Min length | 8 |
Unique
| Unique | 291 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | provisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG |
|---|---|
| 2nd row | provisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG |
| 3rd row | provisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG |
| 4th row | provisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG |
| 5th row | provisional georeference to ST CO PL: Connecticut Middlesex Clinton, 6 Mar 2015, LFG |
| Value | Count | Frequency (%) |
| for | 11797 | 5.4% |
| km | 11604 | 5.3% |
| radius | 10782 | 5.0% |
| georeference | 7659 | 3.5% |
| to | 6875 | 3.2% |
| by | 5881 | 2.7% |
| was | 5876 | 2.7% |
| that | 5847 | 2.7% |
| only | 5832 | 2.7% |
| ex | 5813 | 2.7% |
| Other values (1631) | 139388 |
Most occurring characters
| Value | Count | Frequency (%) |
| 204184 | ||
| e | 140668 | 10.4% |
| r | 102754 | 7.6% |
| i | 71247 | 5.3% |
| o | 67231 | 5.0% |
| s | 56371 | 4.2% |
| a | 55673 | 4.1% |
| n | 55507 | 4.1% |
| t | 49761 | 3.7% |
| d | 47386 | 3.5% |
| Other values (74) | 497781 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 936316 | |
| Space Separator | 204186 | 15.1% |
| Decimal Number | 112539 | 8.3% |
| Uppercase Letter | 62764 | 4.7% |
| Other Punctuation | 22473 | 1.7% |
| Dash Punctuation | 9955 | 0.7% |
| Open Punctuation | 131 | < 0.1% |
| Close Punctuation | 131 | < 0.1% |
| Math Symbol | 66 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 140668 | |
| r | 102754 | |
| i | 71247 | 7.6% |
| o | 67231 | 7.2% |
| s | 56371 | 6.0% |
| a | 55673 | 5.9% |
| n | 55507 | 5.9% |
| t | 49761 | 5.3% |
| d | 47386 | 5.1% |
| c | 38921 | 4.2% |
| Other values (16) | 250797 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 12725 | |
| F | 7867 | |
| M | 7849 | |
| A | 7349 | |
| D | 5962 | |
| G | 2949 | 4.7% |
| C | 2406 | 3.8% |
| L | 2224 | 3.5% |
| O | 2065 | 3.3% |
| N | 1923 | 3.1% |
| Other values (16) | 9445 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 37695 | |
| 0 | 27546 | |
| 2 | 17351 | |
| 9 | 12223 | 10.9% |
| 4 | 11178 | 9.9% |
| 5 | 2315 | 2.1% |
| 6 | 1714 | 1.5% |
| 8 | 1016 | 0.9% |
| 3 | 924 | 0.8% |
| 7 | 577 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 8935 | |
| . | 6543 | |
| : | 3957 | |
| / | 2222 | 9.9% |
| ; | 391 | 1.7% |
| ' | 293 | 1.3% |
| " | 60 | 0.3% |
| & | 48 | 0.2% |
| ? | 16 | 0.1% |
| % | 8 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 43 | |
| + | 20 | |
| ~ | 3 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 204184 | ||
| 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 118 | |
| [ | 13 | 9.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 118 | |
| ] | 13 | 9.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9955 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¤ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 999080 | |
| Common | 349483 | 25.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 140668 | |
| r | 102754 | 10.3% |
| i | 71247 | 7.1% |
| o | 67231 | 6.7% |
| s | 56371 | 5.6% |
| a | 55673 | 5.6% |
| n | 55507 | 5.6% |
| t | 49761 | 5.0% |
| d | 47386 | 4.7% |
| c | 38921 | 3.9% |
| Other values (42) | 313561 |
Common
| Value | Count | Frequency (%) |
| 204184 | ||
| 1 | 37695 | 10.8% |
| 0 | 27546 | 7.9% |
| 2 | 17351 | 5.0% |
| 9 | 12223 | 3.5% |
| 4 | 11178 | 3.2% |
| - | 9955 | 2.8% |
| , | 8935 | 2.6% |
| . | 6543 | 1.9% |
| : | 3957 | 1.1% |
| Other values (22) | 9916 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1348560 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 204184 | ||
| e | 140668 | 10.4% |
| r | 102754 | 7.6% |
| i | 71247 | 5.3% |
| o | 67231 | 5.0% |
| s | 56371 | 4.2% |
| a | 55673 | 4.1% |
| n | 55507 | 4.1% |
| t | 49761 | 3.7% |
| d | 47386 | 3.5% |
| Other values (72) | 497778 |
None
| Value | Count | Frequency (%) |
| 2 | ||
| ¤ | 1 |
typeStatus
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 22.7% |
| Missing | 18844 |
| Missing (%) | 99.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 8.090909091 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 9.1% |
Sample
| 1st row | HYPOTYPE |
|---|---|
| 2nd row | PARATYPE |
| 3rd row | HYPOTYPE |
| 4th row | HYPOTYPE |
| 5th row | HYPOTYPE |
| Value | Count | Frequency (%) |
| hypotype | 13 | |
| paratype | 5 | 22.7% |
| topotype | 2 | 9.1% |
| plesiotype | 1 | 4.5% |
| holotype | 1 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 43 | |
| Y | 35 | |
| T | 24 | |
| E | 23 | |
| O | 20 | |
| H | 14 | 7.9% |
| A | 10 | 5.6% |
| R | 5 | 2.8% |
| L | 2 | 1.1% |
| S | 1 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 178 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 43 | |
| Y | 35 | |
| T | 24 | |
| E | 23 | |
| O | 20 | |
| H | 14 | 7.9% |
| A | 10 | 5.6% |
| R | 5 | 2.8% |
| L | 2 | 1.1% |
| S | 1 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 178 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 43 | |
| Y | 35 | |
| T | 24 | |
| E | 23 | |
| O | 20 | |
| H | 14 | 7.9% |
| A | 10 | 5.6% |
| R | 5 | 2.8% |
| L | 2 | 1.1% |
| S | 1 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 178 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 43 | |
| Y | 35 | |
| T | 24 | |
| E | 23 | |
| O | 20 | |
| H | 14 | 7.9% |
| A | 10 | 5.6% |
| R | 5 | 2.8% |
| L | 2 | 1.1% |
| S | 1 | 0.6% |
identifiedBy
Text
Missing 
| Distinct | 46 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 17735 |
| Missing (%) | 94.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 26 |
|---|---|
| Median length | 21 |
| Mean length | 15.7020336 |
| Min length | 6 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | Gary P. Aronsen |
|---|---|
| 2nd row | Gary P. Aronsen |
| 3rd row | José A. Ottenwalder |
| 4th row | Angus J. Mossman |
| 5th row | Angus J. Mossman |
| Value | Count | Frequency (%) |
| jordan | 278 | 8.9% |
| colosi | 278 | 8.9% |
| g | 278 | 8.9% |
| a | 247 | 7.9% |
| mary | 240 | 7.7% |
| turner | 240 | 7.7% |
| kristof | 101 | 3.2% |
| zyskowski | 101 | 3.2% |
| alex | 100 | 3.2% |
| dornburg | 100 | 3.2% |
| Other values (91) | 1159 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1991 | 11.2% | |
| r | 1773 | 10.0% |
| o | 1434 | 8.1% |
| n | 1105 | 6.2% |
| a | 1041 | 5.9% |
| e | 976 | 5.5% |
| s | 880 | 5.0% |
| i | 864 | 4.9% |
| . | 854 | 4.8% |
| l | 730 | 4.1% |
| Other values (40) | 6111 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11749 | |
| Uppercase Letter | 3145 | 17.7% |
| Space Separator | 1991 | 11.2% |
| Other Punctuation | 854 | 4.8% |
| Dash Punctuation | 20 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 1773 | |
| o | 1434 | |
| n | 1105 | |
| a | 1041 | |
| e | 976 | |
| s | 880 | |
| i | 864 | |
| l | 730 | 6.2% |
| d | 462 | 3.9% |
| y | 415 | 3.5% |
| Other values (15) | 2069 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 467 | |
| J | 384 | |
| C | 371 | |
| M | 341 | |
| G | 309 | |
| K | 263 | |
| T | 244 | |
| D | 107 | 3.4% |
| N | 103 | 3.3% |
| Z | 101 | 3.2% |
| Other values (12) | 455 |
Space Separator
| Value | Count | Frequency (%) |
| 1991 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 854 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14894 | |
| Common | 2865 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 1773 | 11.9% |
| o | 1434 | 9.6% |
| n | 1105 | 7.4% |
| a | 1041 | 7.0% |
| e | 976 | 6.6% |
| s | 880 | 5.9% |
| i | 864 | 5.8% |
| l | 730 | 4.9% |
| A | 467 | 3.1% |
| d | 462 | 3.1% |
| Other values (37) | 5162 |
Common
| Value | Count | Frequency (%) |
| 1991 | ||
| . | 854 | |
| - | 20 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17758 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1991 | 11.2% | |
| r | 1773 | 10.0% |
| o | 1434 | 8.1% |
| n | 1105 | 6.2% |
| a | 1041 | 5.9% |
| e | 976 | 5.5% |
| s | 880 | 5.0% |
| i | 864 | 4.9% |
| . | 854 | 4.8% |
| l | 730 | 4.1% |
| Other values (39) | 6110 |
None
| Value | Count | Frequency (%) |
| é | 1 |
dateIdentified
Text
Missing 
| Distinct | 26 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 17913 |
| Missing (%) | 94.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 2016-01-01T00:00:00 |
|---|---|
| 2nd row | 2016-01-01T00:00:00 |
| 3rd row | 1985-01-01T00:00:00 |
| 4th row | 2016-01-01T00:00:00 |
| 5th row | 2016-01-01T00:00:00 |
| Value | Count | Frequency (%) |
| 2008-01-01t00:00:00 | 271 | |
| 2009-01-01t00:00:00 | 257 | |
| 2007-01-01t00:00:00 | 130 | |
| 2012-01-01t00:00:00 | 126 | |
| 2016-01-01t00:00:00 | 26 | 2.7% |
| 2011-01-01t00:00:00 | 22 | 2.3% |
| 2020-01-01t00:00:00 | 22 | 2.3% |
| 2010-01-01t00:00:00 | 22 | 2.3% |
| 2024-01-01t00:00:00 | 18 | 1.9% |
| 2023-01-01t00:00:00 | 15 | 1.6% |
| Other values (16) | 44 | 4.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9280 | |
| 1 | 2156 | 11.9% |
| - | 1906 | 10.5% |
| : | 1906 | 10.5% |
| 2 | 1137 | 6.3% |
| T | 953 | 5.3% |
| 8 | 276 | 1.5% |
| 9 | 274 | 1.5% |
| 7 | 132 | 0.7% |
| 6 | 33 | 0.2% |
| Other values (3) | 54 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13342 | |
| Dash Punctuation | 1906 | 10.5% |
| Other Punctuation | 1906 | 10.5% |
| Uppercase Letter | 953 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9280 | |
| 1 | 2156 | 16.2% |
| 2 | 1137 | 8.5% |
| 8 | 276 | 2.1% |
| 9 | 274 | 2.1% |
| 7 | 132 | 1.0% |
| 6 | 33 | 0.2% |
| 4 | 25 | 0.2% |
| 3 | 21 | 0.2% |
| 5 | 8 | 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1906 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1906 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 953 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17154 | |
| Latin | 953 | 5.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9280 | |
| 1 | 2156 | 12.6% |
| - | 1906 | 11.1% |
| : | 1906 | 11.1% |
| 2 | 1137 | 6.6% |
| 8 | 276 | 1.6% |
| 9 | 274 | 1.6% |
| 7 | 132 | 0.8% |
| 6 | 33 | 0.2% |
| 4 | 25 | 0.1% |
| Other values (2) | 29 | 0.2% |
Latin
| Value | Count | Frequency (%) |
| T | 953 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18107 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9280 | |
| 1 | 2156 | 11.9% |
| - | 1906 | 10.5% |
| : | 1906 | 10.5% |
| 2 | 1137 | 6.3% |
| T | 953 | 5.3% |
| 8 | 276 | 1.5% |
| 9 | 274 | 1.5% |
| 7 | 132 | 0.7% |
| 6 | 33 | 0.2% |
| Other values (3) | 54 | 0.3% |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 18863 |
| Missing (%) | > 99.9% |
| Memory size | 147.5 KiB |
Length
| Max length | 57 |
|---|---|
| Median length | 6 |
| Mean length | 22.66666667 |
| Min length | 5 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | referenced on page 89 in the descripton of Agouti thomasi |
|---|---|
| 2nd row | Eaton |
| 3rd row | Thorpe |
| Value | Count | Frequency (%) |
| referenced | 1 | |
| on | 1 | |
| page | 1 | |
| 89 | 1 | |
| in | 1 | |
| the | 1 | |
| descripton | 1 | |
| of | 1 | |
| agouti | 1 | |
| thomasi | 1 | |
| Other values (2) | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | ||
| e | 8 | |
| o | 7 | |
| n | 5 | 7.4% |
| t | 5 | 7.4% |
| r | 4 | 5.9% |
| i | 4 | 5.9% |
| h | 3 | 4.4% |
| a | 3 | 4.4% |
| p | 3 | 4.4% |
| Other values (12) | 17 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 54 | |
| Space Separator | 9 | 13.2% |
| Uppercase Letter | 3 | 4.4% |
| Decimal Number | 2 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 8 | |
| o | 7 | |
| n | 5 | |
| t | 5 | |
| r | 4 | 7.4% |
| i | 4 | 7.4% |
| h | 3 | 5.6% |
| a | 3 | 5.6% |
| p | 3 | 5.6% |
| g | 2 | 3.7% |
| Other values (6) | 10 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| A | 1 | |
| T | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 1 | |
| 9 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 9 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 57 | |
| Common | 11 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 8 | |
| o | 7 | |
| n | 5 | 8.8% |
| t | 5 | 8.8% |
| r | 4 | 7.0% |
| i | 4 | 7.0% |
| h | 3 | 5.3% |
| a | 3 | 5.3% |
| p | 3 | 5.3% |
| g | 2 | 3.5% |
| Other values (9) | 13 |
Common
| Value | Count | Frequency (%) |
| 9 | ||
| 8 | 1 | 9.1% |
| 9 | 1 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | ||
| e | 8 | |
| o | 7 | |
| n | 5 | 7.4% |
| t | 5 | 7.4% |
| r | 4 | 5.9% |
| i | 4 | 5.9% |
| h | 3 | 4.4% |
| a | 3 | 4.4% |
| p | 3 | 4.4% |
| Other values (12) | 17 |
scientificName
Text
| Distinct | 1854 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 110 |
|---|---|
| Median length | 59 |
| Mean length | 32.02750981 |
| Min length | 6 |
Unique
| Unique | 618 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Tamias striatus fisheri A.H.Howell, 1925 |
|---|---|
| 2nd row | Peromyscus leucopus (Rafinesque, 1818) |
| 3rd row | Peromyscus leucopus (Rafinesque, 1818) |
| 4th row | Peromyscus leucopus (Rafinesque, 1818) |
| 5th row | Peromyscus leucopus (Rafinesque, 1818) |
| Value | Count | Frequency (%) |
| linnaeus | 2062 | 2.9% |
| peromyscus | 1837 | 2.5% |
| 1758 | 1574 | 2.2% |
| 1830 | 1569 | 2.2% |
| cinereus | 1490 | 2.1% |
| sorex | 1193 | 1.7% |
| brevicauda | 1124 | 1.6% |
| blarina | 976 | 1.4% |
| zibethicus | 898 | 1.2% |
| talpoides | 867 | 1.2% |
| Other values (2496) | 58473 |
Most occurring characters
| Value | Count | Frequency (%) |
| 53197 | 8.8% | |
| s | 44621 | 7.4% |
| a | 41941 | 6.9% |
| e | 40071 | 6.6% |
| i | 39594 | 6.6% |
| u | 33817 | 5.6% |
| r | 32580 | 5.4% |
| n | 27699 | 4.6% |
| o | 26491 | 4.4% |
| l | 20698 | 3.4% |
| Other values (65) | 243522 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 424685 | |
| Decimal Number | 56520 | 9.4% |
| Space Separator | 53197 | 8.8% |
| Uppercase Letter | 35563 | 5.9% |
| Other Punctuation | 16413 | 2.7% |
| Open Punctuation | 8827 | 1.5% |
| Close Punctuation | 8827 | 1.5% |
| Dash Punctuation | 199 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 44621 | |
| a | 41941 | |
| e | 40071 | |
| i | 39594 | |
| u | 33817 | 8.0% |
| r | 32580 | 7.7% |
| n | 27699 | 6.5% |
| o | 26491 | 6.2% |
| l | 20698 | 4.9% |
| c | 20108 | 4.7% |
| Other values (20) | 97065 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3596 | 10.1% |
| L | 3097 | 8.7% |
| S | 3064 | 8.6% |
| M | 3059 | 8.6% |
| C | 2876 | 8.1% |
| G | 2711 | 7.6% |
| B | 2540 | 7.1% |
| R | 1786 | 5.0% |
| T | 1753 | 4.9% |
| O | 1690 | 4.8% |
| Other values (17) | 9391 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17134 | |
| 8 | 12257 | |
| 7 | 5671 | 10.0% |
| 9 | 4373 | 7.7% |
| 5 | 3669 | 6.5% |
| 0 | 3595 | 6.4% |
| 3 | 3256 | 5.8% |
| 6 | 2466 | 4.4% |
| 2 | 2350 | 4.2% |
| 4 | 1749 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14230 | |
| . | 1751 | 10.7% |
| & | 427 | 2.6% |
| ' | 5 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 53197 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8827 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8827 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 199 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 460248 | |
| Common | 143983 | 23.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 44621 | 9.7% |
| a | 41941 | 9.1% |
| e | 40071 | 8.7% |
| i | 39594 | 8.6% |
| u | 33817 | 7.3% |
| r | 32580 | 7.1% |
| n | 27699 | 6.0% |
| o | 26491 | 5.8% |
| l | 20698 | 4.5% |
| c | 20108 | 4.4% |
| Other values (47) | 132628 |
Common
| Value | Count | Frequency (%) |
| 53197 | ||
| 1 | 17134 | 11.9% |
| , | 14230 | 9.9% |
| 8 | 12257 | 8.5% |
| ( | 8827 | 6.1% |
| ) | 8827 | 6.1% |
| 7 | 5671 | 3.9% |
| 9 | 4373 | 3.0% |
| 5 | 3669 | 2.5% |
| 0 | 3595 | 2.5% |
| Other values (8) | 12203 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 603778 | |
| None | 453 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 53197 | 8.8% | |
| s | 44621 | 7.4% |
| a | 41941 | 6.9% |
| e | 40071 | 6.6% |
| i | 39594 | 6.6% |
| u | 33817 | 5.6% |
| r | 32580 | 5.4% |
| n | 27699 | 4.6% |
| o | 26491 | 4.4% |
| l | 20698 | 3.4% |
| Other values (60) | 243069 |
None
| Value | Count | Frequency (%) |
| É | 250 | |
| ü | 136 | |
| è | 28 | 6.2% |
| é | 25 | 5.5% |
| ö | 14 | 3.1% |
| Distinct | 256 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 153 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 231 |
|---|---|
| Median length | 222 |
| Mean length | 176.5778336 |
| Min length | 30 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Sciuromorpha; Sciurida; Sciuridae; Xerinae |
|---|---|
| 2nd row | Animalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae |
| 3rd row | Animalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae |
| 4th row | Animalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae |
| 5th row | Animalia; Chordata; Vertebrata; Amniota; Mammalia; Theriiformes-----Theria-Placentalia-Epitheria; Preptotheria-Anagalida-Simplicidentata; Rodentia; Myomorpha; Myodonta; Muroidea; Cricetidae; Neotominae |
| Value | Count | Frequency (%) |
| animalia | 18713 | 8.8% |
| vertebrata | 18713 | 8.8% |
| chordata | 18713 | 8.8% |
| amniota | 18711 | 8.8% |
| mammalia | 18711 | 8.8% |
| theriiformes-----theria-placentalia-epitheria | 15223 | 7.1% |
| rodentia | 8426 | 3.9% |
| preptotheria-anagalida-simplicidentata | 8425 | 3.9% |
| myomorpha | 5919 | 2.8% |
| myodonta | 5717 | 2.7% |
| Other values (374) | 76277 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 452709 | |
| i | 335054 | 10.1% |
| e | 250563 | 7.6% |
| r | 228161 | 6.9% |
| t | 207108 | 6.3% |
| ; | 194835 | 5.9% |
| 194835 | 5.9% | |
| o | 167342 | 5.1% |
| - | 154910 | 4.7% |
| n | 124331 | 3.8% |
| Other values (40) | 994453 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2464975 | |
| Uppercase Letter | 294746 | 8.9% |
| Other Punctuation | 194835 | 5.9% |
| Space Separator | 194835 | 5.9% |
| Dash Punctuation | 154910 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 452709 | |
| i | 335054 | |
| e | 250563 | |
| r | 228161 | |
| t | 207108 | |
| o | 167342 | 6.8% |
| n | 124331 | 5.0% |
| m | 121823 | 4.9% |
| l | 112560 | 4.6% |
| h | 109691 | 4.4% |
| Other values (14) | 355633 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 54733 | |
| M | 41057 | |
| T | 37832 | |
| P | 36866 | |
| C | 34091 | |
| E | 20860 | 7.1% |
| S | 20718 | 7.0% |
| V | 19658 | 6.7% |
| R | 9908 | 3.4% |
| F | 3662 | 1.2% |
| Other values (13) | 15361 | 5.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 194835 |
Space Separator
| Value | Count | Frequency (%) |
| 194835 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 154910 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2759721 | |
| Common | 544580 | 16.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 452709 | |
| i | 335054 | |
| e | 250563 | 9.1% |
| r | 228161 | 8.3% |
| t | 207108 | 7.5% |
| o | 167342 | 6.1% |
| n | 124331 | 4.5% |
| m | 121823 | 4.4% |
| l | 112560 | 4.1% |
| h | 109691 | 4.0% |
| Other values (37) | 650379 |
Common
| Value | Count | Frequency (%) |
| ; | 194835 | |
| 194835 | ||
| - | 154910 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3304301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 452709 | |
| i | 335054 | 10.1% |
| e | 250563 | 7.6% |
| r | 228161 | 6.9% |
| t | 207108 | 6.3% |
| ; | 194835 | 5.9% |
| 194835 | 5.9% | |
| o | 167342 | 5.1% |
| - | 154910 | 4.7% |
| n | 124331 | 3.8% |
| Other values (40) | 994453 |
kingdom
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.048340931 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 18714 | |
| incertae | 152 | 0.8% |
| sedis | 152 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 37732 | |
| a | 37580 | |
| n | 18866 | |
| A | 18714 | |
| m | 18714 | |
| l | 18714 | |
| e | 456 | 0.3% |
| s | 304 | 0.2% |
| c | 152 | 0.1% |
| r | 152 | 0.1% |
| Other values (3) | 456 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 132974 | |
| Uppercase Letter | 18714 | 12.3% |
| Space Separator | 152 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 37732 | |
| a | 37580 | |
| n | 18866 | |
| m | 18714 | |
| l | 18714 | |
| e | 456 | 0.3% |
| s | 304 | 0.2% |
| c | 152 | 0.1% |
| r | 152 | 0.1% |
| t | 152 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 18714 |
Space Separator
| Value | Count | Frequency (%) |
| 152 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 151688 | |
| Common | 152 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 37732 | |
| a | 37580 | |
| n | 18866 | |
| A | 18714 | |
| m | 18714 | |
| l | 18714 | |
| e | 456 | 0.3% |
| s | 304 | 0.2% |
| c | 152 | 0.1% |
| r | 152 | 0.1% |
| Other values (2) | 304 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 152 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 151840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 37732 | |
| a | 37580 | |
| n | 18866 | |
| A | 18714 | |
| m | 18714 | |
| l | 18714 | |
| e | 456 | 0.3% |
| s | 304 | 0.2% |
| c | 152 | 0.1% |
| r | 152 | 0.1% |
| Other values (3) | 456 | 0.3% |
phylum
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 152 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| 3rd row | Chordata |
| 4th row | Chordata |
| 5th row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 18714 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 37428 | |
| C | 18714 | |
| h | 18714 | |
| o | 18714 | |
| r | 18714 | |
| d | 18714 | |
| t | 18714 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 130998 | |
| Uppercase Letter | 18714 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 37428 | |
| h | 18714 | |
| o | 18714 | |
| r | 18714 | |
| d | 18714 | |
| t | 18714 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 18714 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149712 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 37428 | |
| C | 18714 | |
| h | 18714 | |
| o | 18714 | |
| r | 18714 | |
| d | 18714 | |
| t | 18714 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149712 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 37428 | |
| C | 18714 | |
| h | 18714 | |
| o | 18714 | |
| r | 18714 | |
| d | 18714 | |
| t | 18714 |
class
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 154 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Mammalia |
|---|---|
| 2nd row | Mammalia |
| 3rd row | Mammalia |
| 4th row | Mammalia |
| 5th row | Mammalia |
| Value | Count | Frequency (%) |
| mammalia | 18712 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 56136 | |
| m | 37424 | |
| M | 18712 | 12.5% |
| l | 18712 | 12.5% |
| i | 18712 | 12.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 130984 | |
| Uppercase Letter | 18712 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 56136 | |
| m | 37424 | |
| l | 18712 | 14.3% |
| i | 18712 | 14.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 18712 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149696 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 56136 | |
| m | 37424 | |
| M | 18712 | 12.5% |
| l | 18712 | 12.5% |
| i | 18712 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149696 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 56136 | |
| m | 37424 | |
| M | 18712 | 12.5% |
| l | 18712 | 12.5% |
| i | 18712 | 12.5% |
order
Text
Missing 
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 406 |
| Missing (%) | 2.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 9.43624052 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Rodentia |
|---|---|
| 2nd row | Rodentia |
| 3rd row | Rodentia |
| 4th row | Rodentia |
| 5th row | Rodentia |
| Value | Count | Frequency (%) |
| rodentia | 8426 | |
| soricomorpha | 2476 | 13.4% |
| carnivora | 2371 | 12.8% |
| artiodactyla | 1529 | 8.3% |
| chiroptera | 1102 | 6.0% |
| primates | 953 | 5.2% |
| lagomorpha | 348 | 1.9% |
| diprotodontia | 248 | 1.3% |
| didelphimorphia | 213 | 1.2% |
| perissodactyla | 157 | 0.9% |
| Other values (17) | 637 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 23323 | |
| o | 23289 | |
| i | 18719 | |
| r | 15878 | |
| t | 14533 | |
| e | 11474 | 6.6% |
| n | 11293 | 6.5% |
| d | 10796 | 6.2% |
| R | 8426 | 4.8% |
| p | 4723 | 2.7% |
| Other values (22) | 31739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 155733 | |
| Uppercase Letter | 18460 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 23323 | |
| o | 23289 | |
| i | 18719 | |
| r | 15878 | |
| t | 14533 | |
| e | 11474 | |
| n | 11293 | |
| d | 10796 | |
| p | 4723 | 3.0% |
| c | 4545 | 2.9% |
| Other values (10) | 17160 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 8426 | |
| C | 3679 | |
| S | 2507 | 13.6% |
| A | 1588 | 8.6% |
| P | 1254 | 6.8% |
| D | 495 | 2.7% |
| L | 348 | 1.9% |
| M | 87 | 0.5% |
| E | 41 | 0.2% |
| H | 29 | 0.2% |
| Other values (2) | 6 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 174193 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 23323 | |
| o | 23289 | |
| i | 18719 | |
| r | 15878 | |
| t | 14533 | |
| e | 11474 | 6.6% |
| n | 11293 | 6.5% |
| d | 10796 | 6.2% |
| R | 8426 | 4.8% |
| p | 4723 | 2.7% |
| Other values (22) | 31739 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 174193 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 23323 | |
| o | 23289 | |
| i | 18719 | |
| r | 15878 | |
| t | 14533 | |
| e | 11474 | 6.6% |
| n | 11293 | 6.5% |
| d | 10796 | 6.2% |
| R | 8426 | 4.8% |
| p | 4723 | 2.7% |
| Other values (22) | 31739 |
family
Text
Missing 
| Distinct | 134 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 684 |
| Missing (%) | 3.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 9.657463425 |
| Min length | 6 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sciuridae |
|---|---|
| 2nd row | Cricetidae |
| 3rd row | Cricetidae |
| 4th row | Cricetidae |
| 5th row | Cricetidae |
| Value | Count | Frequency (%) |
| cricetidae | 4133 | |
| soricidae | 2286 | 12.6% |
| sciuridae | 1673 | 9.2% |
| muridae | 1068 | 5.9% |
| bovidae | 837 | 4.6% |
| canidae | 662 | 3.6% |
| dipodidae | 459 | 2.5% |
| mustelidae | 440 | 2.4% |
| cercopithecidae | 421 | 2.3% |
| vespertilionidae | 405 | 2.2% |
| Other values (124) | 5798 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 29346 | |
| e | 27872 | |
| a | 20776 | |
| d | 19527 | |
| r | 13293 | |
| c | 10178 | 5.8% |
| o | 8753 | 5.0% |
| t | 7151 | 4.1% |
| C | 6000 | 3.4% |
| S | 4067 | 2.3% |
| Other values (32) | 28629 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 157410 | |
| Uppercase Letter | 18182 | 10.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 29346 | |
| e | 27872 | |
| a | 20776 | |
| d | 19527 | |
| r | 13293 | |
| c | 10178 | 6.5% |
| o | 8753 | 5.6% |
| t | 7151 | 4.5% |
| u | 3652 | 2.3% |
| l | 3087 | 2.0% |
| Other values (12) | 13775 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 6000 | |
| S | 4067 | |
| M | 1930 | 10.6% |
| P | 1153 | 6.3% |
| D | 866 | 4.8% |
| B | 863 | 4.7% |
| H | 485 | 2.7% |
| V | 476 | 2.6% |
| L | 448 | 2.5% |
| F | 391 | 2.2% |
| Other values (10) | 1503 | 8.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 175592 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 29346 | |
| e | 27872 | |
| a | 20776 | |
| d | 19527 | |
| r | 13293 | |
| c | 10178 | 5.8% |
| o | 8753 | 5.0% |
| t | 7151 | 4.1% |
| C | 6000 | 3.4% |
| S | 4067 | 2.3% |
| Other values (32) | 28629 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 175592 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 29346 | |
| e | 27872 | |
| a | 20776 | |
| d | 19527 | |
| r | 13293 | |
| c | 10178 | 5.8% |
| o | 8753 | 5.0% |
| t | 7151 | 4.1% |
| C | 6000 | 3.4% |
| S | 4067 | 2.3% |
| Other values (32) | 28629 |
genus
Text
Missing 
| Distinct | 610 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 1248 |
| Missing (%) | 6.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 7.820581224 |
| Min length | 3 |
Unique
| Unique | 99 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Tamias |
|---|---|
| 2nd row | Peromyscus |
| 3rd row | Peromyscus |
| 4th row | Peromyscus |
| 5th row | Peromyscus |
| Value | Count | Frequency (%) |
| peromyscus | 1837 | 10.4% |
| sorex | 1183 | 6.7% |
| blarina | 976 | 5.5% |
| myodes | 742 | 4.2% |
| ondatra | 631 | 3.6% |
| microtus | 430 | 2.4% |
| tamias | 398 | 2.3% |
| napaeozapus | 365 | 2.1% |
| canis | 345 | 2.0% |
| procyon | 329 | 1.9% |
| Other values (600) | 10382 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 15209 | 11.0% |
| o | 12469 | 9.0% |
| a | 11828 | 8.6% |
| r | 10473 | 7.6% |
| e | 9289 | 6.7% |
| u | 9207 | 6.7% |
| i | 7477 | 5.4% |
| c | 6205 | 4.5% |
| y | 5919 | 4.3% |
| t | 5122 | 3.7% |
| Other values (38) | 44585 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 120165 | |
| Uppercase Letter | 17618 | 12.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 15209 | |
| o | 12469 | |
| a | 11828 | |
| r | 10473 | 8.7% |
| e | 9289 | 7.7% |
| u | 9207 | 7.7% |
| i | 7477 | 6.2% |
| c | 6205 | 5.2% |
| y | 5919 | 4.9% |
| t | 5122 | 4.3% |
| Other values (15) | 26967 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3036 | |
| M | 2506 | |
| S | 1903 | |
| C | 1609 | |
| O | 1280 | |
| T | 1205 | 6.8% |
| B | 1189 | 6.7% |
| L | 659 | 3.7% |
| N | 645 | 3.7% |
| A | 595 | 3.4% |
| Other values (13) | 2991 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 137783 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 15209 | 11.0% |
| o | 12469 | 9.0% |
| a | 11828 | 8.6% |
| r | 10473 | 7.6% |
| e | 9289 | 6.7% |
| u | 9207 | 6.7% |
| i | 7477 | 5.4% |
| c | 6205 | 4.5% |
| y | 5919 | 4.3% |
| t | 5122 | 3.7% |
| Other values (38) | 44585 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 137783 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 15209 | 11.0% |
| o | 12469 | 9.0% |
| a | 11828 | 8.6% |
| r | 10473 | 7.6% |
| e | 9289 | 6.7% |
| u | 9207 | 6.7% |
| i | 7477 | 5.4% |
| c | 6205 | 4.5% |
| y | 5919 | 4.3% |
| t | 5122 | 3.7% |
| Other values (38) | 44585 |
genericName
Text
Missing 
| Distinct | 610 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 1248 |
| Missing (%) | 6.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 8.097911227 |
| Min length | 3 |
Unique
| Unique | 109 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Tamias |
|---|---|
| 2nd row | Peromyscus |
| 3rd row | Peromyscus |
| 4th row | Peromyscus |
| 5th row | Peromyscus |
| Value | Count | Frequency (%) |
| peromyscus | 1837 | 10.4% |
| sorex | 1193 | 6.8% |
| blarina | 976 | 5.5% |
| clethrionomys | 742 | 4.2% |
| ondatra | 631 | 3.6% |
| microtus | 434 | 2.5% |
| tamias | 398 | 2.3% |
| napaeozapus | 365 | 2.1% |
| canis | 345 | 2.0% |
| procyon | 329 | 1.9% |
| Other values (600) | 10368 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 15048 | 10.5% |
| o | 13419 | 9.4% |
| a | 11754 | 8.2% |
| r | 11226 | 7.9% |
| e | 9339 | 6.5% |
| u | 9072 | 6.4% |
| i | 8183 | 5.7% |
| c | 6202 | 4.3% |
| y | 5916 | 4.1% |
| m | 5689 | 4.0% |
| Other values (37) | 46821 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 125051 | |
| Uppercase Letter | 17618 | 12.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 15048 | |
| o | 13419 | |
| a | 11754 | |
| r | 11226 | 9.0% |
| e | 9339 | 7.5% |
| u | 9072 | 7.3% |
| i | 8183 | 6.5% |
| c | 6202 | 5.0% |
| y | 5916 | 4.7% |
| m | 5689 | 4.5% |
| Other values (14) | 29203 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3033 | |
| C | 2307 | |
| S | 1906 | |
| M | 1595 | |
| O | 1307 | |
| T | 1213 | 6.9% |
| B | 1189 | 6.7% |
| N | 830 | 4.7% |
| L | 660 | 3.7% |
| A | 584 | 3.3% |
| Other values (13) | 2994 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 142669 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 15048 | 10.5% |
| o | 13419 | 9.4% |
| a | 11754 | 8.2% |
| r | 11226 | 7.9% |
| e | 9339 | 6.5% |
| u | 9072 | 6.4% |
| i | 8183 | 5.7% |
| c | 6202 | 4.3% |
| y | 5916 | 4.1% |
| m | 5689 | 4.0% |
| Other values (37) | 46821 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 142669 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 15048 | 10.5% |
| o | 13419 | 9.4% |
| a | 11754 | 8.2% |
| r | 11226 | 7.9% |
| e | 9339 | 6.5% |
| u | 9072 | 6.4% |
| i | 8183 | 5.7% |
| c | 6202 | 4.3% |
| y | 5916 | 4.1% |
| m | 5689 | 4.0% |
| Other values (37) | 46821 |
specificEpithet
Text
Missing 
| Distinct | 949 |
|---|---|
| Distinct (%) | 5.8% |
| Missing | 2554 |
| Missing (%) | 13.5% |
| Memory size | 147.5 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 8.579144188 |
| Min length | 2 |
Unique
| Unique | 226 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | striatus |
|---|---|
| 2nd row | leucopus |
| 3rd row | leucopus |
| 4th row | leucopus |
| 5th row | leucopus |
| Value | Count | Frequency (%) |
| brevicauda | 986 | 6.0% |
| leucopus | 775 | 4.8% |
| cinereus | 747 | 4.6% |
| gapperi | 708 | 4.3% |
| maniculatus | 683 | 4.2% |
| zibethicus | 631 | 3.9% |
| insignis | 365 | 2.2% |
| lotor | 326 | 2.0% |
| canadensis | 320 | 2.0% |
| hudsonicus | 292 | 1.8% |
| Other values (939) | 10479 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 16095 | |
| s | 14920 | |
| u | 14661 | |
| a | 13492 | |
| e | 10462 | 7.5% |
| n | 9056 | 6.5% |
| r | 8917 | 6.4% |
| c | 8678 | 6.2% |
| l | 6251 | 4.5% |
| t | 5825 | 4.2% |
| Other values (17) | 31586 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 139941 | |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 16095 | |
| s | 14920 | |
| u | 14661 | |
| a | 13492 | |
| e | 10462 | 7.5% |
| n | 9056 | 6.5% |
| r | 8917 | 6.4% |
| c | 8678 | 6.2% |
| l | 6251 | 4.5% |
| t | 5825 | 4.2% |
| Other values (16) | 31584 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 139941 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 16095 | |
| s | 14920 | |
| u | 14661 | |
| a | 13492 | |
| e | 10462 | 7.5% |
| n | 9056 | 6.5% |
| r | 8917 | 6.4% |
| c | 8678 | 6.2% |
| l | 6251 | 4.5% |
| t | 5825 | 4.2% |
| Other values (16) | 31584 |
Common
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 139943 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 16095 | |
| s | 14920 | |
| u | 14661 | |
| a | 13492 | |
| e | 10462 | 7.5% |
| n | 9056 | 6.5% |
| r | 8917 | 6.4% |
| c | 8678 | 6.2% |
| l | 6251 | 4.5% |
| t | 5825 | 4.2% |
| Other values (17) | 31586 |
Missing 
| Distinct | 583 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 11638 |
| Missing (%) | 61.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 8.710154953 |
| Min length | 3 |
Unique
| Unique | 203 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | fisheri |
|---|---|
| 2nd row | domesticus |
| 3rd row | domesticus |
| 4th row | domesticus |
| 5th row | domesticus |
| Value | Count | Frequency (%) |
| talpoides | 835 | 11.6% |
| cinereus | 743 | 10.3% |
| pennsylvanicus | 303 | 4.2% |
| fumeus | 275 | 3.8% |
| zibethicus | 267 | 3.7% |
| domesticus | 193 | 2.7% |
| lucifugus | 155 | 2.1% |
| maniculatus | 146 | 2.0% |
| brevicauda | 138 | 1.9% |
| fulvus | 119 | 1.6% |
| Other values (573) | 4054 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 7421 | |
| i | 6875 | |
| e | 6109 | |
| u | 5693 | |
| a | 5422 | 8.6% |
| n | 4224 | 6.7% |
| c | 3632 | 5.8% |
| l | 3386 | 5.4% |
| r | 3314 | 5.3% |
| t | 3035 | 4.8% |
| Other values (16) | 13846 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 62957 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 7421 | |
| i | 6875 | |
| e | 6109 | |
| u | 5693 | |
| a | 5422 | 8.6% |
| n | 4224 | 6.7% |
| c | 3632 | 5.8% |
| l | 3386 | 5.4% |
| r | 3314 | 5.3% |
| t | 3035 | 4.8% |
| Other values (16) | 13846 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62957 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 7421 | |
| i | 6875 | |
| e | 6109 | |
| u | 5693 | |
| a | 5422 | 8.6% |
| n | 4224 | 6.7% |
| c | 3632 | 5.8% |
| l | 3386 | 5.4% |
| r | 3314 | 5.3% |
| t | 3035 | 4.8% |
| Other values (16) | 13846 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62957 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 7421 | |
| i | 6875 | |
| e | 6109 | |
| u | 5693 | |
| a | 5422 | 8.6% |
| n | 4224 | 6.7% |
| c | 3632 | 5.8% |
| l | 3386 | 5.4% |
| r | 3314 | 5.3% |
| t | 3035 | 4.8% |
| Other values (16) | 13846 |
taxonRank
Text
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 7.924732323 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SUBSPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 9084 | |
| subspecies | 7228 | |
| genus | 1306 | 6.9% |
| family | 564 | 3.0% |
| order | 282 | 1.5% |
| class | 248 | 1.3% |
| kingdom | 152 | 0.8% |
| phylum | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 41654 | |
| E | 34212 | |
| I | 17028 | |
| C | 16560 | 11.1% |
| P | 16314 | 10.9% |
| U | 8536 | 5.7% |
| B | 7228 | 4.8% |
| G | 1458 | 1.0% |
| N | 1458 | 1.0% |
| L | 814 | 0.5% |
| Other values (9) | 4246 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 149508 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 41654 | |
| E | 34212 | |
| I | 17028 | |
| C | 16560 | 11.1% |
| P | 16314 | 10.9% |
| U | 8536 | 5.7% |
| B | 7228 | 4.8% |
| G | 1458 | 1.0% |
| N | 1458 | 1.0% |
| L | 814 | 0.5% |
| Other values (9) | 4246 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 149508 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 41654 | |
| E | 34212 | |
| I | 17028 | |
| C | 16560 | 11.1% |
| P | 16314 | 10.9% |
| U | 8536 | 5.7% |
| B | 7228 | 4.8% |
| G | 1458 | 1.0% |
| N | 1458 | 1.0% |
| L | 814 | 0.5% |
| Other values (9) | 4246 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 149508 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 41654 | |
| E | 34212 | |
| I | 17028 | |
| C | 16560 | 11.1% |
| P | 16314 | 10.9% |
| U | 8536 | 5.7% |
| B | 7228 | 4.8% |
| G | 1458 | 1.0% |
| N | 1458 | 1.0% |
| L | 814 | 0.5% |
| Other values (9) | 4246 | 2.8% |
vernacularName
Text
| Distinct | 1166 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 153 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 143 |
|---|---|
| Median length | 121 |
| Mean length | 82.7007428 |
| Min length | 31 |
Unique
| Unique | 294 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | Eastern Chipmunk; chipmunks; squirrels; rodents; mammals; vertebrates; chordates; animals |
|---|---|
| 2nd row | White-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals |
| 3rd row | White-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals |
| 4th row | White-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals |
| 5th row | White-footed Mouse; mice; rodents; mammals; vertebrates; chordates; animals |
| Value | Count | Frequency (%) |
| mammals | 18748 | 11.1% |
| vertebrates | 18713 | 11.1% |
| chordates | 18713 | 11.1% |
| animals | 18713 | 11.1% |
| rodents | 8561 | 5.1% |
| mice | 7296 | 4.3% |
| carnivores | 4733 | 2.8% |
| shrews | 3336 | 2.0% |
| mouse | 2787 | 1.7% |
| squirrels | 2585 | 1.5% |
| Other values (1028) | 64018 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 152163 | 9.8% |
| e | 149527 | 9.7% |
| 149490 | 9.7% | |
| s | 133715 | 8.6% |
| ; | 118068 | 7.6% |
| r | 116349 | 7.5% |
| t | 95576 | 6.2% |
| m | 94542 | 6.1% |
| o | 69125 | 4.5% |
| l | 60718 | 3.9% |
| Other values (50) | 408306 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1230864 | |
| Space Separator | 149490 | 9.7% |
| Other Punctuation | 118585 | 7.7% |
| Uppercase Letter | 39691 | 2.6% |
| Dash Punctuation | 8820 | 0.6% |
| Final Punctuation | 128 | < 0.1% |
| Control | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 152163 | |
| e | 149527 | |
| s | 133715 | |
| r | 116349 | |
| t | 95576 | 7.8% |
| m | 94542 | 7.7% |
| o | 69125 | 5.6% |
| l | 60718 | 4.9% |
| i | 59895 | 4.9% |
| n | 53167 | 4.3% |
| Other values (17) | 246087 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 6525 | |
| M | 5563 | |
| W | 3144 | 7.9% |
| R | 2708 | 6.8% |
| B | 2683 | 6.8% |
| A | 2449 | 6.2% |
| N | 2218 | 5.6% |
| C | 1850 | 4.7% |
| G | 1827 | 4.6% |
| V | 1298 | 3.3% |
| Other values (15) | 9426 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 118068 | |
| ' | 509 | 0.4% |
| . | 4 | < 0.1% |
| ? | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 149490 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8820 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 128 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1270555 | |
| Common | 277024 | 17.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 152163 | |
| e | 149527 | |
| s | 133715 | |
| r | 116349 | |
| t | 95576 | 7.5% |
| m | 94542 | 7.4% |
| o | 69125 | 5.4% |
| l | 60718 | 4.8% |
| i | 59895 | 4.7% |
| n | 53167 | 4.2% |
| Other values (42) | 285778 |
Common
| Value | Count | Frequency (%) |
| 149490 | ||
| ; | 118068 | |
| - | 8820 | 3.2% |
| ' | 509 | 0.2% |
| ’ | 128 | < 0.1% |
| . | 4 | < 0.1% |
| ? | 4 | < 0.1% |
| | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1547439 | |
| Punctuation | 128 | < 0.1% |
| None | 12 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 152163 | 9.8% |
| e | 149527 | 9.7% |
| 149490 | 9.7% | |
| s | 133715 | 8.6% |
| ; | 118068 | 7.6% |
| r | 116349 | 7.5% |
| t | 95576 | 6.2% |
| m | 94542 | 6.1% |
| o | 69125 | 4.5% |
| l | 60718 | 3.9% |
| Other values (47) | 408166 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 128 |
None
| Value | Count | Frequency (%) |
| ü | 11 | |
| | 1 | 8.3% |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ICZN |
|---|---|
| 2nd row | ICZN |
| 3rd row | ICZN |
| 4th row | ICZN |
| 5th row | ICZN |
| Value | Count | Frequency (%) |
| iczn | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| I | 18866 | |
| C | 18866 | |
| Z | 18866 | |
| N | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 75464 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 18866 | |
| C | 18866 | |
| Z | 18866 | |
| N | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 75464 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 18866 | |
| C | 18866 | |
| Z | 18866 | |
| N | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 75464 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| I | 18866 | |
| C | 18866 | |
| Z | 18866 | |
| N | 18866 |
taxonomicStatus
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 152 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.848722881 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ACCEPTED |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | ACCEPTED |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 15796 | |
| synonym | 2831 | 15.1% |
| doubtful | 87 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 31592 | |
| E | 31592 | |
| T | 15883 | |
| D | 15883 | |
| A | 15796 | |
| P | 15796 | |
| Y | 5662 | 3.9% |
| N | 5662 | 3.9% |
| O | 2918 | 2.0% |
| S | 2831 | 1.9% |
| Other values (5) | 3266 | 2.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 146881 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 31592 | |
| E | 31592 | |
| T | 15883 | |
| D | 15883 | |
| A | 15796 | |
| P | 15796 | |
| Y | 5662 | 3.9% |
| N | 5662 | 3.9% |
| O | 2918 | 2.0% |
| S | 2831 | 1.9% |
| Other values (5) | 3266 | 2.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 146881 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 31592 | |
| E | 31592 | |
| T | 15883 | |
| D | 15883 | |
| A | 15796 | |
| P | 15796 | |
| Y | 5662 | 3.9% |
| N | 5662 | 3.9% |
| O | 2918 | 2.0% |
| S | 2831 | 1.9% |
| Other values (5) | 3266 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 146881 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 31592 | |
| E | 31592 | |
| T | 15883 | |
| D | 15883 | |
| A | 15796 | |
| P | 15796 | |
| Y | 5662 | 3.9% |
| N | 5662 | 3.9% |
| O | 2918 | 2.0% |
| S | 2831 | 1.9% |
| Other values (5) | 3266 | 2.2% |
taxonRemarks
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 41 |
|---|---|
| Median length | 41 |
| Mean length | 41 |
| Min length | 41 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animals and Plants: Vertebrates - Mammals |
|---|---|
| 2nd row | Animals and Plants: Vertebrates - Mammals |
| 3rd row | Animals and Plants: Vertebrates - Mammals |
| 4th row | Animals and Plants: Vertebrates - Mammals |
| 5th row | Animals and Plants: Vertebrates - Mammals |
| Value | Count | Frequency (%) |
| animals | 18866 | |
| and | 18866 | |
| plants | 18866 | |
| vertebrates | 18866 | |
| 18866 | ||
| mammals | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 113196 | |
| 94330 | ||
| s | 75464 | |
| e | 56598 | 7.3% |
| m | 56598 | 7.3% |
| l | 56598 | 7.3% |
| n | 56598 | 7.3% |
| t | 56598 | 7.3% |
| r | 37732 | 4.9% |
| A | 18866 | 2.4% |
| Other values (8) | 150928 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 565980 | |
| Space Separator | 94330 | 12.2% |
| Uppercase Letter | 75464 | 9.8% |
| Dash Punctuation | 18866 | 2.4% |
| Other Punctuation | 18866 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 113196 | |
| s | 75464 | |
| e | 56598 | |
| m | 56598 | |
| l | 56598 | |
| n | 56598 | |
| t | 56598 | |
| r | 37732 | 6.7% |
| b | 18866 | 3.3% |
| d | 18866 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 18866 | |
| P | 18866 | |
| V | 18866 | |
| M | 18866 |
Space Separator
| Value | Count | Frequency (%) |
| 94330 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 18866 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 641444 | |
| Common | 132062 | 17.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 113196 | |
| s | 75464 | |
| e | 56598 | |
| m | 56598 | |
| l | 56598 | |
| n | 56598 | |
| t | 56598 | |
| r | 37732 | 5.9% |
| A | 18866 | 2.9% |
| b | 18866 | 2.9% |
| Other values (5) | 94330 |
Common
| Value | Count | Frequency (%) |
| 94330 | ||
| - | 18866 | 14.3% |
| : | 18866 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 773506 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 113196 | |
| 94330 | ||
| s | 75464 | |
| e | 56598 | 7.3% |
| m | 56598 | 7.3% |
| l | 56598 | 7.3% |
| n | 56598 | 7.3% |
| t | 56598 | 7.3% |
| r | 37732 | 4.9% |
| A | 18866 | 2.4% |
| Other values (8) | 150928 |
datasetKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 854f602e-f762-11e1-a439-00145eb45e9a |
|---|---|
| 2nd row | 854f602e-f762-11e1-a439-00145eb45e9a |
| 3rd row | 854f602e-f762-11e1-a439-00145eb45e9a |
| 4th row | 854f602e-f762-11e1-a439-00145eb45e9a |
| 5th row | 854f602e-f762-11e1-a439-00145eb45e9a |
| Value | Count | Frequency (%) |
| 854f602e-f762-11e1-a439-00145eb45e9a | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 75464 | |
| e | 75464 | |
| - | 75464 | |
| 1 | 75464 | |
| 5 | 56598 | |
| 0 | 56598 | |
| f | 37732 | 5.6% |
| 6 | 37732 | 5.6% |
| 2 | 37732 | 5.6% |
| a | 37732 | 5.6% |
| Other values (5) | 113196 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 433918 | |
| Lowercase Letter | 169794 | 25.0% |
| Dash Punctuation | 75464 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 75464 | |
| 1 | 75464 | |
| 5 | 56598 | |
| 0 | 56598 | |
| 6 | 37732 | |
| 2 | 37732 | |
| 9 | 37732 | |
| 8 | 18866 | 4.3% |
| 7 | 18866 | 4.3% |
| 3 | 18866 | 4.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 75464 | |
| f | 37732 | |
| a | 37732 | |
| b | 18866 | 11.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 75464 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 509382 | |
| Latin | 169794 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 75464 | |
| - | 75464 | |
| 1 | 75464 | |
| 5 | 56598 | |
| 0 | 56598 | |
| 6 | 37732 | |
| 2 | 37732 | |
| 9 | 37732 | |
| 8 | 18866 | 3.7% |
| 7 | 18866 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| e | 75464 | |
| f | 37732 | |
| a | 37732 | |
| b | 18866 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 679176 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 75464 | |
| e | 75464 | |
| - | 75464 | |
| 1 | 75464 | |
| 5 | 56598 | |
| 0 | 56598 | |
| f | 37732 | 5.6% |
| 6 | 37732 | 5.6% |
| 2 | 37732 | 5.6% |
| a | 37732 | 5.6% |
| Other values (5) | 113196 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 18866 | |
| S | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 37732 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 18866 | |
| S | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37732 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 18866 | |
| S | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37732 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 18866 | |
| S | 18866 |
lastInterpreted
Text
| Distinct | 4639 |
|---|---|
| Distinct (%) | 24.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.9970317 |
| Min length | 20 |
Unique
| Unique | 411 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | 2025-01-08T13:41:37.071Z |
|---|---|
| 2nd row | 2025-01-08T13:41:37.575Z |
| 3rd row | 2025-01-08T13:41:36.570Z |
| 4th row | 2025-01-08T13:41:33.336Z |
| 5th row | 2025-01-08T13:41:31.987Z |
| Value | Count | Frequency (%) |
| 2025-01-08t13:41:34.959z | 13 | 0.1% |
| 2025-01-08t13:41:37.272z | 12 | 0.1% |
| 2025-01-08t13:41:35.416z | 12 | 0.1% |
| 2025-01-08t13:41:37.511z | 11 | 0.1% |
| 2025-01-08t13:41:37.284z | 11 | 0.1% |
| 2025-01-08t13:41:35.688z | 11 | 0.1% |
| 2025-01-08t13:41:34.086z | 11 | 0.1% |
| 2025-01-08t13:41:31.753z | 11 | 0.1% |
| 2025-01-08t13:41:34.469z | 11 | 0.1% |
| 2025-01-08t13:41:36.351z | 10 | 0.1% |
| Other values (4629) | 18753 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| - | 37732 | |
| : | 37732 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 5.4% |
| T | 18866 | 4.2% |
| Other values (5) | 61844 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 320680 | |
| Other Punctuation | 56584 | 12.5% |
| Dash Punctuation | 37732 | 8.3% |
| Uppercase Letter | 37732 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 7.6% |
| 6 | 9517 | 3.0% |
| 7 | 9009 | 2.8% |
| 9 | 5600 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37732 | |
| . | 18852 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 414996 | |
| Latin | 37732 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| - | 37732 | |
| : | 37732 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 5.9% |
| . | 18852 | 4.5% |
| Other values (3) | 24126 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 452728 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| - | 37732 | |
| : | 37732 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 5.4% |
| T | 18866 | 4.2% |
| Other values (5) | 61844 |
elevation
Text
Missing 
| Distinct | 156 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 17391 |
| Missing (%) | 92.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.444067797 |
| Min length | 3 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | 61.0 |
|---|---|
| 2nd row | 61.0 |
| 3rd row | 638.0 |
| 4th row | 638.0 |
| 5th row | 1143.0 |
| Value | Count | Frequency (%) |
| 1829.0 | 124 | 8.4% |
| 61.0 | 104 | 7.1% |
| 2896.0 | 60 | 4.1% |
| 700.0 | 59 | 4.0% |
| 2134.0 | 59 | 4.0% |
| 638.0 | 56 | 3.8% |
| 1000.0 | 53 | 3.6% |
| 500.0 | 42 | 2.8% |
| 1402.0 | 29 | 2.0% |
| 1280.0 | 29 | 2.0% |
| Other values (146) | 860 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| . | 1475 | |
| 1 | 934 | 11.6% |
| 2 | 633 | 7.9% |
| 8 | 506 | 6.3% |
| 6 | 445 | 5.5% |
| 9 | 373 | 4.6% |
| 3 | 369 | 4.6% |
| 7 | 309 | 3.8% |
| 5 | 294 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6555 | |
| Other Punctuation | 1475 | 18.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| 1 | 934 | 14.2% |
| 2 | 633 | 9.7% |
| 8 | 506 | 7.7% |
| 6 | 445 | 6.8% |
| 9 | 373 | 5.7% |
| 3 | 369 | 5.6% |
| 7 | 309 | 4.7% |
| 5 | 294 | 4.5% |
| 4 | 292 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1475 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8030 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| . | 1475 | |
| 1 | 934 | 11.6% |
| 2 | 633 | 7.9% |
| 8 | 506 | 6.3% |
| 6 | 445 | 5.5% |
| 9 | 373 | 4.6% |
| 3 | 369 | 4.6% |
| 7 | 309 | 3.8% |
| 5 | 294 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8030 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2400 | |
| . | 1475 | |
| 1 | 934 | 11.6% |
| 2 | 633 | 7.9% |
| 8 | 506 | 6.3% |
| 6 | 445 | 5.5% |
| 9 | 373 | 4.6% |
| 3 | 369 | 4.6% |
| 7 | 309 | 3.8% |
| 5 | 294 | 3.7% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 18082 |
| Missing (%) | 95.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 3 |
| Mean length | 3.005102041 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
| Value | Count | Frequency (%) |
| 0.0 | 782 | |
| 152.5 | 2 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| . | 784 | |
| 5 | 4 | 0.2% |
| 1 | 2 | 0.1% |
| 2 | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1572 | |
| Other Punctuation | 784 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| 5 | 4 | 0.3% |
| 1 | 2 | 0.1% |
| 2 | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 784 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2356 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| . | 784 | |
| 5 | 4 | 0.2% |
| 1 | 2 | 0.1% |
| 2 | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2356 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1564 | |
| . | 784 | |
| 5 | 4 | 0.2% |
| 1 | 2 | 0.1% |
| 2 | 2 | 0.1% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 18788 |
| Missing (%) | 99.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 15.24358974 |
| Min length | 3 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | 9.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 1678.9213806293344 |
| 3rd row | 0.0 |
| 4th row | 3317.269457389723 |
| 5th row | 4308.557461717021 |
| Value | Count | Frequency (%) |
| 4308.557461717021 | 30 | |
| 1132.2847034170802 | 13 | |
| 0.0 | 12 | 15.4% |
| 2569.2685781328946 | 9 | 11.5% |
| 3322.3754451523614 | 3 | 3.8% |
| 2427.113575024377 | 2 | 2.6% |
| 4700.828968112741 | 2 | 2.6% |
| 1678.9213806293344 | 1 | 1.3% |
| 3317.269457389723 | 1 | 1.3% |
| 2524.2049532876945 | 1 | 1.3% |
| Other values (4) | 4 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 7 | 156 | |
| 0 | 138 | |
| 2 | 133 | |
| 4 | 120 | |
| 5 | 100 | |
| 3 | 98 | |
| 8 | 98 | |
| . | 78 | |
| 6 | 71 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1111 | |
| Other Punctuation | 78 | 6.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 7 | 156 | |
| 0 | 138 | |
| 2 | 133 | |
| 4 | 120 | |
| 5 | 100 | |
| 3 | 98 | |
| 8 | 98 | |
| 6 | 71 | |
| 9 | 31 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 78 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1189 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 7 | 156 | |
| 0 | 138 | |
| 2 | 133 | |
| 4 | 120 | |
| 5 | 100 | |
| 3 | 98 | |
| 8 | 98 | |
| . | 78 | |
| 6 | 71 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1189 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 166 | |
| 7 | 156 | |
| 0 | 138 | |
| 2 | 133 | |
| 4 | 120 | |
| 5 | 100 | |
| 3 | 98 | |
| 8 | 98 | |
| . | 78 | |
| 6 | 71 |
issue
Text
| Distinct | 46 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 178 |
|---|---|
| Median length | 72 |
| Mean length | 79.85301601 |
| Min length | 72 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY |
|---|---|
| 2nd row | TAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY |
| 3rd row | TAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY |
| 4th row | TAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY |
| 5th row | TAXON_MATCH_HIGHERRANK;OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;INSTITUTION_MATCH_FUZZY |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 12822 | |
| taxon_match_higherrank;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 3130 | 16.6% |
| coordinate_rounded;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 1102 | 5.8% |
| continent_coordinate_mismatch;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 347 | 1.8% |
| recorded_date_mismatch;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 299 | 1.6% |
| coordinate_reprojected;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 181 | 1.0% |
| taxon_match_none;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 152 | 0.8% |
| taxon_match_fuzzy;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 139 | 0.7% |
| coordinate_rounded;taxon_match_higherrank;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 130 | 0.7% |
| continent_derived_from_coordinates;occurrence_status_inferred_from_individual_count;institution_match_fuzzy | 80 | 0.4% |
| Other values (36) | 484 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 144679 | 9.6% |
| _ | 143839 | 9.5% |
| I | 139261 | 9.2% |
| N | 125769 | 8.3% |
| U | 115218 | 7.6% |
| R | 106323 | 7.1% |
| C | 102478 | 6.8% |
| O | 86554 | 5.7% |
| E | 85787 | 5.7% |
| A | 71280 | 4.7% |
| Other values (18) | 385319 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1336919 | |
| Connector Punctuation | 143839 | 9.5% |
| Other Punctuation | 25381 | 1.7% |
| Decimal Number | 368 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 144679 | |
| I | 139261 | |
| N | 125769 | |
| U | 115218 | 8.6% |
| R | 106323 | 8.0% |
| C | 102478 | 7.7% |
| O | 86554 | 6.5% |
| E | 85787 | 6.4% |
| A | 71280 | 5.3% |
| D | 63686 | 4.8% |
| Other values (14) | 295884 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 184 | |
| 4 | 184 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 143839 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 25381 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1336919 | |
| Common | 169588 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 144679 | |
| I | 139261 | |
| N | 125769 | |
| U | 115218 | 8.6% |
| R | 106323 | 8.0% |
| C | 102478 | 7.7% |
| O | 86554 | 6.5% |
| E | 85787 | 6.4% |
| A | 71280 | 5.3% |
| D | 63686 | 4.8% |
| Other values (14) | 295884 |
Common
| Value | Count | Frequency (%) |
| _ | 143839 | |
| ; | 25381 | 15.0% |
| 8 | 184 | 0.1% |
| 4 | 184 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1506507 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 144679 | 9.6% |
| _ | 143839 | 9.5% |
| I | 139261 | 9.2% |
| N | 125769 | 8.3% |
| U | 115218 | 7.6% |
| R | 106323 | 7.1% |
| C | 102478 | 6.8% |
| O | 86554 | 5.7% |
| E | 85787 | 5.7% |
| A | 71280 | 4.7% |
| Other values (18) | 385319 |
mediaType
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 18411 |
| Missing (%) | 97.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 455 |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 910 | |
| S | 455 | |
| t | 455 | |
| i | 455 | |
| I | 455 | |
| m | 455 | |
| a | 455 | |
| g | 455 | |
| e | 455 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3640 | |
| Uppercase Letter | 910 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 910 | |
| t | 455 | |
| i | 455 | |
| m | 455 | |
| a | 455 | |
| g | 455 | |
| e | 455 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 455 | |
| I | 455 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4550 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 910 | |
| S | 455 | |
| t | 455 | |
| i | 455 | |
| I | 455 | |
| m | 455 | |
| a | 455 | |
| g | 455 | |
| e | 455 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 910 | |
| S | 455 | |
| t | 455 | |
| i | 455 | |
| I | 455 | |
| m | 455 | |
| a | 455 | |
| g | 455 | |
| e | 455 |
hasCoordinate
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.293808969 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| true | 13323 | |
| false | 5543 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 18866 | |
| t | 13323 | |
| r | 13323 | |
| u | 13323 | |
| f | 5543 | 6.8% |
| a | 5543 | 6.8% |
| l | 5543 | 6.8% |
| s | 5543 | 6.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 81007 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18866 | |
| t | 13323 | |
| r | 13323 | |
| u | 13323 | |
| f | 5543 | 6.8% |
| a | 5543 | 6.8% |
| l | 5543 | 6.8% |
| s | 5543 | 6.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 81007 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 18866 | |
| t | 13323 | |
| r | 13323 | |
| u | 13323 | |
| f | 5543 | 6.8% |
| a | 5543 | 6.8% |
| l | 5543 | 6.8% |
| s | 5543 | 6.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 81007 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 18866 | |
| t | 13323 | |
| r | 13323 | |
| u | 13323 | |
| f | 5543 | 6.8% |
| a | 5543 | 6.8% |
| l | 5543 | 6.8% |
| s | 5543 | 6.8% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.997985795 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 18828 | |
| true | 38 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 18866 | |
| f | 18828 | |
| a | 18828 | |
| l | 18828 | |
| s | 18828 | |
| t | 38 | < 0.1% |
| r | 38 | < 0.1% |
| u | 38 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 94292 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 18866 | |
| f | 18828 | |
| a | 18828 | |
| l | 18828 | |
| s | 18828 | |
| t | 38 | < 0.1% |
| r | 38 | < 0.1% |
| u | 38 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 94292 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 18866 | |
| f | 18828 | |
| a | 18828 | |
| l | 18828 | |
| s | 18828 | |
| t | 38 | < 0.1% |
| r | 38 | < 0.1% |
| u | 38 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94292 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 18866 | |
| f | 18828 | |
| a | 18828 | |
| l | 18828 | |
| s | 18828 | |
| t | 38 | < 0.1% |
| r | 38 | < 0.1% |
| u | 38 | < 0.1% |
taxonKey
Text
| Distinct | 1854 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.769055444 |
| Min length | 1 |
Unique
| Unique | 618 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | 4263596 |
|---|---|
| 2nd row | 2438019 |
| 3rd row | 2438019 |
| 4th row | 2438019 |
| 5th row | 2438019 |
| Value | Count | Frequency (%) |
| 6163288 | 835 | 4.4% |
| 2438019 | 774 | 4.1% |
| 7059215 | 728 | 3.9% |
| 2439137 | 691 | 3.7% |
| 2437967 | 459 | 2.4% |
| 2439461 | 365 | 1.9% |
| 5219858 | 364 | 1.9% |
| 7194100 | 275 | 1.5% |
| 6163538 | 267 | 1.4% |
| 359 | 248 | 1.3% |
| Other values (1844) | 13860 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 18522 | |
| 4 | 16888 | |
| 3 | 15746 | |
| 1 | 14417 | |
| 9 | 12202 | |
| 6 | 11986 | |
| 7 | 10030 | |
| 5 | 9755 | |
| 8 | 9707 | |
| 0 | 8452 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 127705 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 18522 | |
| 4 | 16888 | |
| 3 | 15746 | |
| 1 | 14417 | |
| 9 | 12202 | |
| 6 | 11986 | |
| 7 | 10030 | |
| 5 | 9755 | |
| 8 | 9707 | |
| 0 | 8452 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 127705 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 18522 | |
| 4 | 16888 | |
| 3 | 15746 | |
| 1 | 14417 | |
| 9 | 12202 | |
| 6 | 11986 | |
| 7 | 10030 | |
| 5 | 9755 | |
| 8 | 9707 | |
| 0 | 8452 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 127705 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 18522 | |
| 4 | 16888 | |
| 3 | 15746 | |
| 1 | 14417 | |
| 9 | 12202 | |
| 6 | 11986 | |
| 7 | 10030 | |
| 5 | 9755 | |
| 8 | 9707 | |
| 0 | 8452 |
acceptedTaxonKey
Text
| Distinct | 1774 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 152 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.811424602 |
| Min length | 2 |
Unique
| Unique | 580 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | 4263596 |
|---|---|
| 2nd row | 2438019 |
| 3rd row | 2438019 |
| 4th row | 2438019 |
| 5th row | 2438019 |
| Value | Count | Frequency (%) |
| 6163288 | 835 | 4.5% |
| 2438019 | 775 | 4.1% |
| 7059215 | 728 | 3.9% |
| 5706760 | 708 | 3.8% |
| 5219858 | 631 | 3.4% |
| 2437967 | 528 | 2.8% |
| 2439461 | 365 | 2.0% |
| 7194100 | 275 | 1.5% |
| 2438655 | 259 | 1.4% |
| 359 | 248 | 1.3% |
| Other values (1764) | 13362 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 18721 | |
| 4 | 16423 | |
| 3 | 13842 | |
| 1 | 13333 | |
| 6 | 13306 | |
| 9 | 11427 | |
| 7 | 10399 | |
| 5 | 10313 | |
| 8 | 10107 | |
| 0 | 9598 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 127469 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 18721 | |
| 4 | 16423 | |
| 3 | 13842 | |
| 1 | 13333 | |
| 6 | 13306 | |
| 9 | 11427 | |
| 7 | 10399 | |
| 5 | 10313 | |
| 8 | 10107 | |
| 0 | 9598 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 127469 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 18721 | |
| 4 | 16423 | |
| 3 | 13842 | |
| 1 | 13333 | |
| 6 | 13306 | |
| 9 | 11427 | |
| 7 | 10399 | |
| 5 | 10313 | |
| 8 | 10107 | |
| 0 | 9598 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 127469 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 18721 | |
| 4 | 16423 | |
| 3 | 13842 | |
| 1 | 13333 | |
| 6 | 13306 | |
| 9 | 11427 | |
| 7 | 10399 | |
| 5 | 10313 | |
| 8 | 10107 | |
| 0 | 9598 |
kingdomKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 18714 | |
| 0 | 152 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 18714 | |
| 0 | 152 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18866 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 18714 | |
| 0 | 152 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 18866 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 18714 | |
| 0 | 152 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18866 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 18714 | |
| 0 | 152 | 0.8% |
phylumKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 152 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 44 |
|---|---|
| 2nd row | 44 |
| 3rd row | 44 |
| 4th row | 44 |
| 5th row | 44 |
| Value | Count | Frequency (%) |
| 44 | 18714 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 37428 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 37428 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 37428 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37428 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 37428 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37428 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 37428 |
classKey
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 154 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 359 |
|---|---|
| 2nd row | 359 |
| 3rd row | 359 |
| 4th row | 359 |
| 5th row | 359 |
| Value | Count | Frequency (%) |
| 359 | 18712 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 18712 | |
| 5 | 18712 | |
| 9 | 18712 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 56136 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 18712 | |
| 5 | 18712 | |
| 9 | 18712 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 56136 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 18712 | |
| 5 | 18712 | |
| 9 | 18712 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56136 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 18712 | |
| 5 | 18712 | |
| 9 | 18712 |
orderKey
Text
Missing 
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 406 |
| Missing (%) | 2.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.474702059 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1459 |
|---|---|
| 2nd row | 1459 |
| 3rd row | 1459 |
| 4th row | 1459 |
| 5th row | 1459 |
| Value | Count | Frequency (%) |
| 1459 | 8426 | |
| 803 | 2476 | 13.4% |
| 732 | 2371 | 12.8% |
| 731 | 1529 | 8.3% |
| 734 | 1102 | 6.0% |
| 798 | 953 | 5.2% |
| 785 | 348 | 1.9% |
| 1452 | 248 | 1.3% |
| 783 | 213 | 1.2% |
| 795 | 157 | 0.9% |
| Other values (17) | 637 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 10349 | |
| 4 | 9983 | |
| 9 | 9799 | |
| 5 | 9346 | |
| 3 | 8072 | |
| 7 | 7095 | |
| 8 | 4213 | |
| 2 | 2730 | 4.3% |
| 0 | 2511 | 3.9% |
| 6 | 45 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 64143 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 10349 | |
| 4 | 9983 | |
| 9 | 9799 | |
| 5 | 9346 | |
| 3 | 8072 | |
| 7 | 7095 | |
| 8 | 4213 | |
| 2 | 2730 | 4.3% |
| 0 | 2511 | 3.9% |
| 6 | 45 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 64143 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 10349 | |
| 4 | 9983 | |
| 9 | 9799 | |
| 5 | 9346 | |
| 3 | 8072 | |
| 7 | 7095 | |
| 8 | 4213 | |
| 2 | 2730 | 4.3% |
| 0 | 2511 | 3.9% |
| 6 | 45 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 64143 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 10349 | |
| 4 | 9983 | |
| 9 | 9799 | |
| 5 | 9346 | |
| 3 | 8072 | |
| 7 | 7095 | |
| 8 | 4213 | |
| 2 | 2730 | 4.3% |
| 0 | 2511 | 3.9% |
| 6 | 45 | 0.1% |
familyKey
Text
Missing 
| Distinct | 134 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 684 |
| Missing (%) | 3.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.745132549 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 9456 |
|---|---|
| 2nd row | 3240723 |
| 3rd row | 3240723 |
| 4th row | 3240723 |
| 5th row | 3240723 |
| Value | Count | Frequency (%) |
| 3240723 | 4133 | |
| 5534 | 2286 | 12.6% |
| 9456 | 1673 | 9.2% |
| 5510 | 1068 | 5.9% |
| 9614 | 837 | 4.6% |
| 9701 | 662 | 3.6% |
| 9435 | 459 | 2.5% |
| 5307 | 440 | 2.4% |
| 9622 | 421 | 2.3% |
| 9368 | 405 | 2.2% |
| Other values (124) | 5798 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 15416 | |
| 5 | 13473 | |
| 4 | 11897 | |
| 2 | 10515 | |
| 9 | 9211 | |
| 0 | 7550 | |
| 7 | 7183 | |
| 6 | 5300 | 6.1% |
| 1 | 4201 | 4.9% |
| 8 | 1530 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 86276 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 15416 | |
| 5 | 13473 | |
| 4 | 11897 | |
| 2 | 10515 | |
| 9 | 9211 | |
| 0 | 7550 | |
| 7 | 7183 | |
| 6 | 5300 | 6.1% |
| 1 | 4201 | 4.9% |
| 8 | 1530 | 1.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 86276 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 15416 | |
| 5 | 13473 | |
| 4 | 11897 | |
| 2 | 10515 | |
| 9 | 9211 | |
| 0 | 7550 | |
| 7 | 7183 | |
| 6 | 5300 | 6.1% |
| 1 | 4201 | 4.9% |
| 8 | 1530 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 86276 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 15416 | |
| 5 | 13473 | |
| 4 | 11897 | |
| 2 | 10515 | |
| 9 | 9211 | |
| 0 | 7550 | |
| 7 | 7183 | |
| 6 | 5300 | 6.1% |
| 1 | 4201 | 4.9% |
| 8 | 1530 | 1.8% |
genusKey
Text
Missing 
| Distinct | 612 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 1248 |
| Missing (%) | 6.6% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.000908162 |
| Min length | 7 |
Unique
| Unique | 99 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 2437422 |
|---|---|
| 2nd row | 2437961 |
| 3rd row | 2437961 |
| 4th row | 2437961 |
| 5th row | 2437961 |
| Value | Count | Frequency (%) |
| 2437961 | 1837 | 10.4% |
| 2435935 | 1183 | 6.7% |
| 2435858 | 976 | 5.5% |
| 2438724 | 742 | 4.2% |
| 5219857 | 631 | 3.6% |
| 2438591 | 430 | 2.4% |
| 2437422 | 398 | 2.3% |
| 2439460 | 365 | 2.1% |
| 5219142 | 345 | 2.0% |
| 2433592 | 329 | 1.9% |
| Other values (602) | 10382 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 24012 | |
| 4 | 21930 | |
| 3 | 20143 | |
| 5 | 11636 | |
| 9 | 10645 | |
| 7 | 8680 | 7.0% |
| 8 | 8149 | 6.6% |
| 1 | 7388 | 6.0% |
| 6 | 6542 | 5.3% |
| 0 | 4217 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 123342 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 24012 | |
| 4 | 21930 | |
| 3 | 20143 | |
| 5 | 11636 | |
| 9 | 10645 | |
| 7 | 8680 | 7.0% |
| 8 | 8149 | 6.6% |
| 1 | 7388 | 6.0% |
| 6 | 6542 | 5.3% |
| 0 | 4217 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 123342 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 24012 | |
| 4 | 21930 | |
| 3 | 20143 | |
| 5 | 11636 | |
| 9 | 10645 | |
| 7 | 8680 | 7.0% |
| 8 | 8149 | 6.6% |
| 1 | 7388 | 6.0% |
| 6 | 6542 | 5.3% |
| 0 | 4217 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 123342 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 24012 | |
| 4 | 21930 | |
| 3 | 20143 | |
| 5 | 11636 | |
| 9 | 10645 | |
| 7 | 8680 | 7.0% |
| 8 | 8149 | 6.6% |
| 1 | 7388 | 6.0% |
| 6 | 6542 | 5.3% |
| 0 | 4217 | 3.4% |
speciesKey
Text
Missing 
| Distinct | 1113 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 2554 |
| Missing (%) | 13.5% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.003126533 |
| Min length | 7 |
Unique
| Unique | 305 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | 2437438 |
|---|---|
| 2nd row | 2438019 |
| 3rd row | 2438019 |
| 4th row | 2438019 |
| 5th row | 2438019 |
| Value | Count | Frequency (%) |
| 2435862 | 975 | 6.0% |
| 2438019 | 775 | 4.8% |
| 2435964 | 739 | 4.5% |
| 5706760 | 708 | 4.3% |
| 2437967 | 683 | 4.2% |
| 5219858 | 631 | 3.9% |
| 2439461 | 365 | 2.2% |
| 5218786 | 327 | 2.0% |
| 2437282 | 292 | 1.8% |
| 2435947 | 281 | 1.7% |
| Other values (1103) | 10536 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 20244 | |
| 4 | 17780 | |
| 3 | 14803 | |
| 5 | 10031 | |
| 6 | 9653 | |
| 8 | 9437 | |
| 9 | 9094 | |
| 7 | 8673 | |
| 1 | 7654 | 6.7% |
| 0 | 6866 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 114235 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 20244 | |
| 4 | 17780 | |
| 3 | 14803 | |
| 5 | 10031 | |
| 6 | 9653 | |
| 8 | 9437 | |
| 9 | 9094 | |
| 7 | 8673 | |
| 1 | 7654 | 6.7% |
| 0 | 6866 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 114235 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 20244 | |
| 4 | 17780 | |
| 3 | 14803 | |
| 5 | 10031 | |
| 6 | 9653 | |
| 8 | 9437 | |
| 9 | 9094 | |
| 7 | 8673 | |
| 1 | 7654 | 6.7% |
| 0 | 6866 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114235 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 20244 | |
| 4 | 17780 | |
| 3 | 14803 | |
| 5 | 10031 | |
| 6 | 9653 | |
| 8 | 9437 | |
| 9 | 9094 | |
| 7 | 8673 | |
| 1 | 7654 | 6.7% |
| 0 | 6866 | 6.0% |
species
Text
Missing 
| Distinct | 1112 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 2554 |
| Missing (%) | 13.5% |
| Memory size | 147.5 KiB |
Length
| Max length | 29 |
|---|---|
| Median length | 25 |
| Mean length | 17.32203286 |
| Min length | 9 |
Unique
| Unique | 305 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | Tamias striatus |
|---|---|
| 2nd row | Peromyscus leucopus |
| 3rd row | Peromyscus leucopus |
| 4th row | Peromyscus leucopus |
| 5th row | Peromyscus leucopus |
| Value | Count | Frequency (%) |
| peromyscus | 1633 | 5.0% |
| sorex | 1176 | 3.6% |
| brevicauda | 986 | 3.0% |
| blarina | 976 | 3.0% |
| leucopus | 775 | 2.4% |
| cinereus | 747 | 2.3% |
| myodes | 742 | 2.3% |
| gapperi | 708 | 2.2% |
| maniculatus | 683 | 2.1% |
| ondatra | 631 | 1.9% |
| Other values (1485) | 23567 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 29016 | 10.3% |
| a | 24276 | 8.6% |
| u | 23247 | 8.2% |
| i | 22681 | 8.0% |
| e | 18844 | 6.7% |
| r | 18410 | 6.5% |
| o | 17053 | 6.0% |
| 16312 | 5.8% | |
| c | 14413 | 5.1% |
| n | 13336 | 4.7% |
| Other values (40) | 84969 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 249933 | |
| Space Separator | 16312 | 5.8% |
| Uppercase Letter | 16312 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 29016 | |
| a | 24276 | |
| u | 23247 | |
| i | 22681 | 9.1% |
| e | 18844 | 7.5% |
| r | 18410 | 7.4% |
| o | 17053 | 6.8% |
| c | 14413 | 5.8% |
| n | 13336 | 5.3% |
| l | 10927 | 4.4% |
| Other values (16) | 57730 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 2710 | |
| M | 2200 | |
| S | 1866 | |
| C | 1481 | |
| O | 1191 | |
| B | 1187 | |
| T | 1149 | |
| N | 619 | 3.8% |
| A | 577 | 3.5% |
| D | 533 | 3.3% |
| Other values (13) | 2799 |
Space Separator
| Value | Count | Frequency (%) |
| 16312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 266245 | |
| Common | 16312 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 29016 | 10.9% |
| a | 24276 | 9.1% |
| u | 23247 | 8.7% |
| i | 22681 | 8.5% |
| e | 18844 | 7.1% |
| r | 18410 | 6.9% |
| o | 17053 | 6.4% |
| c | 14413 | 5.4% |
| n | 13336 | 5.0% |
| l | 10927 | 4.1% |
| Other values (39) | 74042 |
Common
| Value | Count | Frequency (%) |
| 16312 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 282557 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 29016 | 10.3% |
| a | 24276 | 8.6% |
| u | 23247 | 8.2% |
| i | 22681 | 8.0% |
| e | 18844 | 6.7% |
| r | 18410 | 6.5% |
| o | 17053 | 6.0% |
| 16312 | 5.8% | |
| c | 14413 | 5.1% |
| n | 13336 | 4.7% |
| Other values (40) | 84969 |
| Distinct | 1774 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 152 |
| Missing (%) | 0.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 110 |
|---|---|
| Median length | 58 |
| Mean length | 31.81564604 |
| Min length | 6 |
Unique
| Unique | 580 ? |
|---|---|
| Unique (%) | 3.1% |
Sample
| 1st row | Tamias striatus fisheri A.H.Howell, 1925 |
|---|---|
| 2nd row | Peromyscus leucopus (Rafinesque, 1818) |
| 3rd row | Peromyscus leucopus (Rafinesque, 1818) |
| 4th row | Peromyscus leucopus (Rafinesque, 1818) |
| 5th row | Peromyscus leucopus (Rafinesque, 1818) |
| Value | Count | Frequency (%) |
| linnaeus | 2733 | 3.8% |
| 1758 | 1977 | 2.8% |
| peromyscus | 1837 | 2.6% |
| 1830 | 1587 | 2.2% |
| cinereus | 1487 | 2.1% |
| sorex | 1183 | 1.6% |
| brevicauda | 1124 | 1.6% |
| blarina | 976 | 1.4% |
| talpoides | 867 | 1.2% |
| 1766 | 860 | 1.2% |
| Other values (2410) | 57206 |
Most occurring characters
| Value | Count | Frequency (%) |
| 53123 | 8.9% | |
| s | 43421 | 7.3% |
| a | 41483 | 7.0% |
| e | 39164 | 6.6% |
| i | 38084 | 6.4% |
| u | 33788 | 5.7% |
| r | 31615 | 5.3% |
| n | 27100 | 4.6% |
| o | 25118 | 4.2% |
| l | 19859 | 3.3% |
| Other values (65) | 242643 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 410635 | |
| Decimal Number | 58980 | 9.9% |
| Space Separator | 53123 | 8.9% |
| Uppercase Letter | 36255 | 6.1% |
| Other Punctuation | 17087 | 2.9% |
| Open Punctuation | 9556 | 1.6% |
| Close Punctuation | 9556 | 1.6% |
| Dash Punctuation | 206 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 43421 | |
| a | 41483 | |
| e | 39164 | |
| i | 38084 | |
| u | 33788 | 8.2% |
| r | 31615 | 7.7% |
| n | 27100 | 6.6% |
| o | 25118 | 6.1% |
| l | 19859 | 4.8% |
| c | 19094 | 4.6% |
| Other values (20) | 91909 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 3933 | |
| L | 3788 | |
| P | 3612 | 10.0% |
| S | 3105 | 8.6% |
| G | 2600 | 7.2% |
| B | 2447 | 6.7% |
| C | 2194 | 6.1% |
| O | 1923 | 5.3% |
| T | 1717 | 4.7% |
| A | 1619 | 4.5% |
| Other values (17) | 9317 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 17754 | |
| 8 | 12985 | |
| 7 | 6043 | 10.2% |
| 5 | 4184 | 7.1% |
| 9 | 3974 | 6.7% |
| 0 | 3867 | 6.6% |
| 3 | 3367 | 5.7% |
| 6 | 2805 | 4.8% |
| 2 | 2295 | 3.9% |
| 4 | 1706 | 2.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 14845 | |
| . | 1934 | 11.3% |
| & | 303 | 1.8% |
| ' | 5 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 53123 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 9556 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 9556 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 206 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 446890 | |
| Common | 148508 | 24.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 43421 | 9.7% |
| a | 41483 | 9.3% |
| e | 39164 | 8.8% |
| i | 38084 | 8.5% |
| u | 33788 | 7.6% |
| r | 31615 | 7.1% |
| n | 27100 | 6.1% |
| o | 25118 | 5.6% |
| l | 19859 | 4.4% |
| c | 19094 | 4.3% |
| Other values (47) | 128164 |
Common
| Value | Count | Frequency (%) |
| 53123 | ||
| 1 | 17754 | 12.0% |
| , | 14845 | 10.0% |
| 8 | 12985 | 8.7% |
| ( | 9556 | 6.4% |
| ) | 9556 | 6.4% |
| 7 | 6043 | 4.1% |
| 5 | 4184 | 2.8% |
| 9 | 3974 | 2.7% |
| 0 | 3867 | 2.6% |
| Other values (8) | 12621 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 594814 | |
| None | 584 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 53123 | 8.9% | |
| s | 43421 | 7.3% |
| a | 41483 | 7.0% |
| e | 39164 | 6.6% |
| i | 38084 | 6.4% |
| u | 33788 | 5.7% |
| r | 31615 | 5.3% |
| n | 27100 | 4.6% |
| o | 25118 | 4.2% |
| l | 19859 | 3.3% |
| Other values (60) | 242059 |
None
| Value | Count | Frequency (%) |
| É | 384 | |
| ü | 133 | 22.8% |
| è | 28 | 4.8% |
| é | 25 | 4.3% |
| ö | 14 | 2.4% |
| Distinct | 2018 |
|---|---|
| Distinct (%) | 10.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 43 |
|---|---|
| Median length | 34 |
| Mean length | 22.09201739 |
| Min length | 3 |
Unique
| Unique | 703 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | Tamias striatus fisheri |
|---|---|
| 2nd row | Peromyscus leucopus noveboracensis |
| 3rd row | Peromyscus leucopus noveboracensis |
| 4th row | Peromyscus leucopus noveboracensis |
| 5th row | Peromyscus leucopus noveboracensis |
| Value | Count | Frequency (%) |
| peromyscus | 1837 | 4.0% |
| cinereus | 1489 | 3.2% |
| sorex | 1193 | 2.6% |
| brevicauda | 1125 | 2.4% |
| blarina | 976 | 2.1% |
| zibethicus | 898 | 2.0% |
| talpoides | 868 | 1.9% |
| gapperi | 848 | 1.8% |
| maniculatus | 829 | 1.8% |
| leucopus | 782 | 1.7% |
| Other values (2070) | 35113 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 41623 | 10.0% |
| i | 36625 | 8.8% |
| a | 35093 | 8.4% |
| u | 30890 | 7.4% |
| e | 30381 | 7.3% |
| 27092 | 6.5% | |
| r | 26522 | 6.4% |
| o | 25267 | 6.1% |
| n | 22452 | 5.4% |
| c | 20781 | 5.0% |
| Other values (43) | 120062 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 370969 | |
| Space Separator | 27092 | 6.5% |
| Uppercase Letter | 18716 | 4.5% |
| Other Punctuation | 9 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 41623 | |
| i | 36625 | |
| a | 35093 | |
| u | 30890 | 8.3% |
| e | 30381 | 8.2% |
| r | 26522 | 7.1% |
| o | 25267 | 6.8% |
| n | 22452 | 6.1% |
| c | 20781 | 5.6% |
| l | 16432 | 4.4% |
| Other values (16) | 84903 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3107 | |
| C | 2505 | |
| S | 1952 | |
| M | 1925 | |
| B | 1452 | |
| O | 1312 | |
| T | 1217 | 6.5% |
| N | 831 | 4.4% |
| L | 676 | 3.6% |
| A | 598 | 3.2% |
| Other values (13) | 3141 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7 | |
| ? | 2 | 22.2% |
Space Separator
| Value | Count | Frequency (%) |
| 27092 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 389685 | |
| Common | 27103 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 41623 | |
| i | 36625 | 9.4% |
| a | 35093 | 9.0% |
| u | 30890 | 7.9% |
| e | 30381 | 7.8% |
| r | 26522 | 6.8% |
| o | 25267 | 6.5% |
| n | 22452 | 5.8% |
| c | 20781 | 5.3% |
| l | 16432 | 4.2% |
| Other values (39) | 103619 |
Common
| Value | Count | Frequency (%) |
| 27092 | ||
| . | 7 | < 0.1% |
| ? | 2 | < 0.1% |
| - | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 416788 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 41623 | 10.0% |
| i | 36625 | 8.8% |
| a | 35093 | 8.4% |
| u | 30890 | 7.4% |
| e | 30381 | 7.3% |
| 27092 | 6.5% | |
| r | 26522 | 6.4% |
| o | 25267 | 6.1% |
| n | 22452 | 5.4% |
| c | 20781 | 5.0% |
| Other values (43) | 120062 |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 18866 | |
| M | 18866 | |
| L | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 56598 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 18866 | |
| M | 18866 | |
| L | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 56598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 18866 | |
| M | 18866 | |
| L | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56598 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 18866 | |
| M | 18866 | |
| L | 18866 |
lastParsed
Text
| Distinct | 4639 |
|---|---|
| Distinct (%) | 24.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.9970317 |
| Min length | 20 |
Unique
| Unique | 411 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | 2025-01-08T13:41:37.071Z |
|---|---|
| 2nd row | 2025-01-08T13:41:37.575Z |
| 3rd row | 2025-01-08T13:41:36.570Z |
| 4th row | 2025-01-08T13:41:33.336Z |
| 5th row | 2025-01-08T13:41:31.987Z |
| Value | Count | Frequency (%) |
| 2025-01-08t13:41:34.959z | 13 | 0.1% |
| 2025-01-08t13:41:37.272z | 12 | 0.1% |
| 2025-01-08t13:41:35.416z | 12 | 0.1% |
| 2025-01-08t13:41:37.511z | 11 | 0.1% |
| 2025-01-08t13:41:37.284z | 11 | 0.1% |
| 2025-01-08t13:41:35.688z | 11 | 0.1% |
| 2025-01-08t13:41:34.086z | 11 | 0.1% |
| 2025-01-08t13:41:31.753z | 11 | 0.1% |
| 2025-01-08t13:41:34.469z | 11 | 0.1% |
| 2025-01-08t13:41:36.351z | 10 | 0.1% |
| Other values (4629) | 18753 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| - | 37732 | |
| : | 37732 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 5.4% |
| T | 18866 | 4.2% |
| Other values (5) | 61844 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 320680 | |
| Other Punctuation | 56584 | 12.5% |
| Dash Punctuation | 37732 | 8.3% |
| Uppercase Letter | 37732 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 7.6% |
| 6 | 9517 | 3.0% |
| 7 | 9009 | 2.8% |
| 9 | 5600 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37732 | |
| . | 18852 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 414996 | |
| Latin | 37732 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| - | 37732 | |
| : | 37732 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 5.9% |
| . | 18852 | 4.5% |
| Other values (3) | 24126 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 452728 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 62891 | |
| 0 | 62226 | |
| 3 | 46125 | |
| 2 | 45206 | |
| - | 37732 | |
| : | 37732 | |
| 4 | 27889 | |
| 5 | 27713 | |
| 8 | 24504 | 5.4% |
| T | 18866 | 4.2% |
| Other values (5) | 61844 |
lastCrawled
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2025-01-08T13:41:11.140Z |
|---|---|
| 2nd row | 2025-01-08T13:41:11.140Z |
| 3rd row | 2025-01-08T13:41:11.140Z |
| 4th row | 2025-01-08T13:41:11.140Z |
| 5th row | 2025-01-08T13:41:11.140Z |
| Value | Count | Frequency (%) |
| 2025-01-08t13:41:11.140z | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 113196 | |
| 0 | 75464 | |
| 2 | 37732 | 8.3% |
| - | 37732 | 8.3% |
| : | 37732 | 8.3% |
| 4 | 37732 | 8.3% |
| 5 | 18866 | 4.2% |
| 8 | 18866 | 4.2% |
| T | 18866 | 4.2% |
| 3 | 18866 | 4.2% |
| Other values (2) | 37732 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 320722 | |
| Other Punctuation | 56598 | 12.5% |
| Dash Punctuation | 37732 | 8.3% |
| Uppercase Letter | 37732 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 113196 | |
| 0 | 75464 | |
| 2 | 37732 | 11.8% |
| 4 | 37732 | 11.8% |
| 5 | 18866 | 5.9% |
| 8 | 18866 | 5.9% |
| 3 | 18866 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 37732 | |
| . | 18866 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37732 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 415052 | |
| Latin | 37732 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 113196 | |
| 0 | 75464 | |
| 2 | 37732 | 9.1% |
| - | 37732 | 9.1% |
| : | 37732 | 9.1% |
| 4 | 37732 | 9.1% |
| 5 | 18866 | 4.5% |
| 8 | 18866 | 4.5% |
| 3 | 18866 | 4.5% |
| . | 18866 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| T | 18866 | |
| Z | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 452784 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 113196 | |
| 0 | 75464 | |
| 2 | 37732 | 8.3% |
| - | 37732 | 8.3% |
| : | 37732 | 8.3% |
| 4 | 37732 | 8.3% |
| 5 | 18866 | 4.2% |
| 8 | 18866 | 4.2% |
| T | 18866 | 4.2% |
| 3 | 18866 | 4.2% |
| Other values (2) | 37732 | 8.3% |
repatriated
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3910 |
| Missing (%) | 20.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.674511902 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 10088 | |
| true | 4868 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 14956 | |
| f | 10088 | |
| a | 10088 | |
| l | 10088 | |
| s | 10088 | |
| t | 4868 | 7.0% |
| r | 4868 | 7.0% |
| u | 4868 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 69912 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 14956 | |
| f | 10088 | |
| a | 10088 | |
| l | 10088 | |
| s | 10088 | |
| t | 4868 | 7.0% |
| r | 4868 | 7.0% |
| u | 4868 | 7.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 69912 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 14956 | |
| f | 10088 | |
| a | 10088 | |
| l | 10088 | |
| s | 10088 | |
| t | 4868 | 7.0% |
| r | 4868 | 7.0% |
| u | 4868 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 14956 | |
| f | 10088 | |
| a | 10088 | |
| l | 10088 | |
| s | 10088 | |
| t | 4868 | 7.0% |
| r | 4868 | 7.0% |
| u | 4868 | 7.0% |
isSequenced
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 18866 | |
| a | 18866 | |
| l | 18866 | |
| s | 18866 | |
| e | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 94330 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 18866 | |
| a | 18866 | |
| l | 18866 | |
| s | 18866 | |
| e | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 94330 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 18866 | |
| a | 18866 | |
| l | 18866 | |
| s | 18866 | |
| e | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 94330 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 18866 | |
| a | 18866 | |
| l | 18866 | |
| s | 18866 | |
| e | 18866 |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3929 |
| Missing (%) | 20.8% |
| Memory size | 147.5 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.54395126 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 10775 | |
| africa | 1928 | 12.9% |
| latin_america | 1203 | 8.1% |
| asia | 590 | 3.9% |
| europe | 303 | 2.0% |
| oceania | 136 | 0.9% |
| antarctica | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 30473 | |
| R | 24986 | |
| I | 15837 | |
| C | 14046 | |
| E | 12720 | |
| N | 12116 | 7.0% |
| T | 11982 | 6.9% |
| _ | 11978 | 6.9% |
| M | 11978 | 6.9% |
| O | 11214 | 6.5% |
| Other values (6) | 15102 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 160454 | |
| Connector Punctuation | 11978 | 6.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 30473 | |
| R | 24986 | |
| I | 15837 | |
| C | 14046 | |
| E | 12720 | |
| N | 12116 | 7.6% |
| T | 11982 | 7.5% |
| M | 11978 | 7.5% |
| O | 11214 | 7.0% |
| H | 10775 | 6.7% |
| Other values (5) | 4327 | 2.7% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 11978 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 160454 | |
| Common | 11978 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 30473 | |
| R | 24986 | |
| I | 15837 | |
| C | 14046 | |
| E | 12720 | |
| N | 12116 | 7.6% |
| T | 11982 | 7.5% |
| M | 11978 | 7.5% |
| O | 11214 | 7.0% |
| H | 10775 | 6.7% |
| Other values (5) | 4327 | 2.7% |
Common
| Value | Count | Frequency (%) |
| _ | 11978 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 172432 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 30473 | |
| R | 24986 | |
| I | 15837 | |
| C | 14046 | |
| E | 12720 | |
| N | 12116 | 7.0% |
| T | 11982 | 6.9% |
| _ | 11978 | 6.9% |
| M | 11978 | 6.9% |
| O | 11214 | 6.5% |
| Other values (6) | 15102 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 147.5 KiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 18866 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 37732 | |
| A | 37732 | |
| N | 18866 | |
| O | 18866 | |
| T | 18866 | |
| H | 18866 | |
| _ | 18866 | |
| M | 18866 | |
| E | 18866 | |
| I | 18866 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 226392 | |
| Connector Punctuation | 18866 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 37732 | |
| A | 37732 | |
| N | 18866 | |
| O | 18866 | |
| T | 18866 | |
| H | 18866 | |
| M | 18866 | |
| E | 18866 | |
| I | 18866 | |
| C | 18866 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 18866 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 226392 | |
| Common | 18866 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 37732 | |
| A | 37732 | |
| N | 18866 | |
| O | 18866 | |
| T | 18866 | |
| H | 18866 | |
| M | 18866 | |
| E | 18866 | |
| I | 18866 | |
| C | 18866 |
Common
| Value | Count | Frequency (%) |
| _ | 18866 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 245258 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 37732 | |
| A | 37732 | |
| N | 18866 | |
| O | 18866 | |
| T | 18866 | |
| H | 18866 | |
| _ | 18866 | |
| M | 18866 | |
| E | 18866 | |
| I | 18866 |
level0Gid
Text
Missing 
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 5871 |
| Missing (%) | 31.1% |
| Memory size | 147.5 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | USA |
|---|---|
| 2nd row | USA |
| 3rd row | USA |
| 4th row | USA |
| 5th row | USA |
| Value | Count | Frequency (%) |
| usa | 9272 | |
| can | 599 | 4.6% |
| ken | 564 | 4.3% |
| mex | 510 | 3.9% |
| egy | 334 | 2.6% |
| idn | 283 | 2.2% |
| ecu | 207 | 1.6% |
| grc | 118 | 0.9% |
| cmr | 80 | 0.6% |
| tza | 79 | 0.6% |
| Other values (84) | 949 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 10201 | |
| U | 9653 | |
| S | 9423 | |
| N | 1727 | 4.4% |
| E | 1709 | 4.4% |
| C | 1189 | 3.0% |
| M | 764 | 2.0% |
| G | 624 | 1.6% |
| K | 592 | 1.5% |
| X | 510 | 1.3% |
| Other values (16) | 2593 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 38985 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10201 | |
| U | 9653 | |
| S | 9423 | |
| N | 1727 | 4.4% |
| E | 1709 | 4.4% |
| C | 1189 | 3.0% |
| M | 764 | 2.0% |
| G | 624 | 1.6% |
| K | 592 | 1.5% |
| X | 510 | 1.3% |
| Other values (16) | 2593 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38985 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10201 | |
| U | 9653 | |
| S | 9423 | |
| N | 1727 | 4.4% |
| E | 1709 | 4.4% |
| C | 1189 | 3.0% |
| M | 764 | 2.0% |
| G | 624 | 1.6% |
| K | 592 | 1.5% |
| X | 510 | 1.3% |
| Other values (16) | 2593 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 38985 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 10201 | |
| U | 9653 | |
| S | 9423 | |
| N | 1727 | 4.4% |
| E | 1709 | 4.4% |
| C | 1189 | 3.0% |
| M | 764 | 2.0% |
| G | 624 | 1.6% |
| K | 592 | 1.5% |
| X | 510 | 1.3% |
| Other values (16) | 2593 | 6.7% |
level0Name
Text
Missing 
| Distinct | 94 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 5871 |
| Missing (%) | 31.1% |
| Memory size | 147.5 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 13 |
| Mean length | 11.20176991 |
| Min length | 4 |
Unique
| Unique | 20 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 9285 | |
| states | 9272 | |
| canada | 599 | 2.7% |
| kenya | 564 | 2.5% |
| méxico | 510 | 2.3% |
| egypt | 334 | 1.5% |
| indonesia | 283 | 1.3% |
| ecuador | 207 | 0.9% |
| greece | 118 | 0.5% |
| cameroon | 80 | 0.4% |
| Other values (101) | 1268 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 28368 | |
| e | 20349 | |
| a | 13650 | |
| n | 11735 | |
| i | 10936 | 7.5% |
| d | 10558 | 7.3% |
| s | 9727 | 6.7% |
| 9525 | 6.5% | |
| S | 9390 | 6.5% |
| U | 9288 | 6.4% |
| Other values (44) | 12041 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 113606 | |
| Uppercase Letter | 22424 | 15.4% |
| Space Separator | 9525 | 6.5% |
| Other Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 28368 | |
| e | 20349 | |
| a | 13650 | |
| n | 11735 | |
| i | 10936 | 9.6% |
| d | 10558 | 9.3% |
| s | 9727 | 8.6% |
| o | 1481 | 1.3% |
| c | 1062 | 0.9% |
| y | 961 | 0.8% |
| Other values (19) | 4779 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 9390 | |
| U | 9288 | |
| C | 869 | 3.9% |
| K | 578 | 2.6% |
| M | 562 | 2.5% |
| E | 550 | 2.5% |
| I | 377 | 1.7% |
| G | 165 | 0.7% |
| T | 117 | 0.5% |
| B | 95 | 0.4% |
| Other values (11) | 433 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 9 | |
| . | 2 | 16.7% |
| , | 1 | 8.3% |
Space Separator
| Value | Count | Frequency (%) |
| 9525 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 136030 | |
| Common | 9537 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 28368 | |
| e | 20349 | |
| a | 13650 | |
| n | 11735 | |
| i | 10936 | 8.0% |
| d | 10558 | 7.8% |
| s | 9727 | 7.2% |
| S | 9390 | 6.9% |
| U | 9288 | 6.8% |
| o | 1481 | 1.1% |
| Other values (40) | 10548 | 7.8% |
Common
| Value | Count | Frequency (%) |
| 9525 | ||
| ' | 9 | 0.1% |
| . | 2 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 145034 | |
| None | 533 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 28368 | |
| e | 20349 | |
| a | 13650 | |
| n | 11735 | |
| i | 10936 | 7.5% |
| d | 10558 | 7.3% |
| s | 9727 | 6.7% |
| 9525 | 6.6% | |
| S | 9390 | 6.5% |
| U | 9288 | 6.4% |
| Other values (41) | 11508 |
None
| Value | Count | Frequency (%) |
| é | 510 | |
| ç | 14 | 2.6% |
| ô | 9 | 1.7% |
level1Gid
Text
Missing 
| Distinct | 341 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 5888 |
| Missing (%) | 31.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.678301741 |
| Min length | 7 |
Unique
| Unique | 105 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | USA.7_1 |
|---|---|
| 2nd row | USA.7_1 |
| 3rd row | USA.39_1 |
| 4th row | USA.39_1 |
| 5th row | USA.39_1 |
| Value | Count | Frequency (%) |
| usa.30_1 | 2868 | |
| usa.7_1 | 1293 | 10.0% |
| usa.24_1 | 551 | 4.2% |
| usa.6_1 | 447 | 3.4% |
| usa.33_1 | 446 | 3.4% |
| usa.3_1 | 436 | 3.4% |
| usa.50_1 | 423 | 3.3% |
| can.2_1 | 336 | 2.6% |
| usa.5_1 | 290 | 2.2% |
| idn.23_1 | 277 | 2.1% |
| Other values (331) | 5611 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 15054 | |
| . | 12978 | |
| _ | 12943 | |
| A | 10197 | |
| U | 9639 | |
| S | 9423 | |
| 3 | 5867 | 5.9% |
| 0 | 3709 | 3.7% |
| 2 | 2826 | 2.8% |
| 7 | 2156 | 2.2% |
| Other values (28) | 14857 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39039 | |
| Decimal Number | 34689 | |
| Other Punctuation | 12978 | 13.0% |
| Connector Punctuation | 12943 | 13.0% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10197 | |
| U | 9639 | |
| S | 9423 | |
| N | 1727 | 4.4% |
| E | 1709 | 4.4% |
| C | 1175 | 3.0% |
| M | 763 | 2.0% |
| G | 659 | 1.7% |
| K | 627 | 1.6% |
| X | 510 | 1.3% |
| Other values (16) | 2610 | 6.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15054 | |
| 3 | 5867 | 16.9% |
| 0 | 3709 | 10.7% |
| 2 | 2826 | 8.1% |
| 7 | 2156 | 6.2% |
| 4 | 1816 | 5.2% |
| 5 | 1294 | 3.7% |
| 6 | 970 | 2.8% |
| 8 | 551 | 1.6% |
| 9 | 446 | 1.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 12978 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12943 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 60610 | |
| Latin | 39039 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10197 | |
| U | 9639 | |
| S | 9423 | |
| N | 1727 | 4.4% |
| E | 1709 | 4.4% |
| C | 1175 | 3.0% |
| M | 763 | 2.0% |
| G | 659 | 1.7% |
| K | 627 | 1.6% |
| X | 510 | 1.3% |
| Other values (16) | 2610 | 6.7% |
Common
| Value | Count | Frequency (%) |
| 1 | 15054 | |
| . | 12978 | |
| _ | 12943 | |
| 3 | 5867 | 9.7% |
| 0 | 3709 | 6.1% |
| 2 | 2826 | 4.7% |
| 7 | 2156 | 3.6% |
| 4 | 1816 | 3.0% |
| 5 | 1294 | 2.1% |
| 6 | 970 | 1.6% |
| Other values (2) | 997 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99649 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 15054 | |
| . | 12978 | |
| _ | 12943 | |
| A | 10197 | |
| U | 9639 | |
| S | 9423 | |
| 3 | 5867 | 5.9% |
| 0 | 3709 | 3.7% |
| 2 | 2826 | 2.8% |
| 7 | 2156 | 2.2% |
| Other values (28) | 14857 |
level1Name
Text
Missing 
| Distinct | 339 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 5888 |
| Missing (%) | 31.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 9.578979812 |
| Min length | 3 |
Unique
| Unique | 105 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Connecticut |
|---|---|
| 2nd row | Connecticut |
| 3rd row | Pennsylvania |
| 4th row | Pennsylvania |
| 5th row | Pennsylvania |
| Value | Count | Frequency (%) |
| new | 3505 | |
| hampshire | 2868 | |
| connecticut | 1293 | 7.3% |
| minnesota | 551 | 3.1% |
| colorado | 447 | 2.5% |
| york | 446 | 2.5% |
| arizona | 436 | 2.5% |
| wisconsin | 423 | 2.4% |
| british | 336 | 1.9% |
| columbia | 336 | 1.9% |
| Other values (386) | 6996 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12131 | 9.8% |
| i | 10970 | 8.8% |
| e | 10741 | 8.6% |
| n | 8249 | 6.6% |
| o | 8034 | 6.5% |
| s | 7338 | 5.9% |
| r | 7053 | 5.7% |
| t | 5362 | 4.3% |
| 4659 | 3.7% | |
| h | 4513 | 3.6% |
| Other values (54) | 45266 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 101897 | |
| Uppercase Letter | 17647 | 14.2% |
| Space Separator | 4659 | 3.7% |
| Dash Punctuation | 102 | 0.1% |
| Other Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12131 | |
| i | 10970 | |
| e | 10741 | |
| n | 8249 | 8.1% |
| o | 8034 | 7.9% |
| s | 7338 | 7.2% |
| r | 7053 | 6.9% |
| t | 5362 | 5.3% |
| h | 4513 | 4.4% |
| w | 3956 | 3.9% |
| Other values (24) | 23550 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 4225 | |
| H | 2947 | |
| C | 2656 | |
| M | 1539 | 8.7% |
| A | 1203 | 6.8% |
| W | 776 | 4.4% |
| P | 539 | 3.1% |
| Y | 450 | 2.6% |
| T | 407 | 2.3% |
| B | 400 | 2.3% |
| Other values (16) | 2505 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 6 | |
| ' | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 4659 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 119544 | |
| Common | 4772 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12131 | 10.1% |
| i | 10970 | 9.2% |
| e | 10741 | 9.0% |
| n | 8249 | 6.9% |
| o | 8034 | 6.7% |
| s | 7338 | 6.1% |
| r | 7053 | 5.9% |
| t | 5362 | 4.5% |
| h | 4513 | 3.8% |
| N | 4225 | 3.5% |
| Other values (50) | 40928 |
Common
| Value | Count | Frequency (%) |
| 4659 | ||
| - | 102 | 2.1% |
| , | 6 | 0.1% |
| ' | 5 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 124049 | |
| None | 267 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12131 | 9.8% |
| i | 10970 | 8.8% |
| e | 10741 | 8.7% |
| n | 8249 | 6.6% |
| o | 8034 | 6.5% |
| s | 7338 | 5.9% |
| r | 7053 | 5.7% |
| t | 5362 | 4.3% |
| 4659 | 3.8% | |
| h | 4513 | 3.6% |
| Other values (46) | 44999 |
None
| Value | Count | Frequency (%) |
| á | 122 | |
| é | 69 | |
| ó | 37 | 13.9% |
| í | 24 | 9.0% |
| ô | 10 | 3.7% |
| ý | 3 | 1.1% |
| ö | 1 | 0.4% |
| š | 1 | 0.4% |
level2Gid
Text
Missing 
| Distinct | 973 |
|---|---|
| Distinct (%) | 7.5% |
| Missing | 5935 |
| Missing (%) | 31.5% |
| Memory size | 147.5 KiB |
Length
| Max length | 12 |
|---|---|
| Median length | 10 |
| Mean length | 10.09287758 |
| Min length | 7 |
Unique
| Unique | 358 ? |
|---|---|
| Unique (%) | 2.8% |
Sample
| 1st row | USA.7.5_1 |
|---|---|
| 2nd row | USA.7.5_1 |
| 3rd row | USA.39.9_1 |
| 4th row | USA.39.51_1 |
| 5th row | USA.39.9_1 |
| Value | Count | Frequency (%) |
| usa.30.2_1 | 2600 | 20.1% |
| usa.7.5_1 | 626 | 4.8% |
| usa.24.11_1 | 354 | 2.7% |
| usa.7.3_1 | 328 | 2.5% |
| usa.6.27_1 | 268 | 2.1% |
| idn.23.5_1 | 244 | 1.9% |
| usa.30.4_1 | 164 | 1.3% |
| egy.17.9_1 | 162 | 1.3% |
| usa.50.26_1 | 162 | 1.3% |
| usa.7.1_1 | 144 | 1.1% |
| Other values (963) | 7879 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 25827 | |
| 1 | 18015 | |
| _ | 12931 | |
| A | 10186 | 7.8% |
| U | 9638 | 7.4% |
| S | 9412 | 7.2% |
| 2 | 8521 | 6.5% |
| 3 | 7867 | 6.0% |
| 0 | 4300 | 3.3% |
| 5 | 3455 | 2.6% |
| Other values (28) | 20359 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 52960 | |
| Uppercase Letter | 38793 | |
| Other Punctuation | 25827 | |
| Connector Punctuation | 12931 | 9.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 10186 | |
| U | 9638 | |
| S | 9412 | |
| E | 1706 | 4.4% |
| N | 1692 | 4.4% |
| C | 1133 | 2.9% |
| M | 752 | 1.9% |
| G | 632 | 1.6% |
| K | 627 | 1.6% |
| X | 510 | 1.3% |
| Other values (16) | 2505 | 6.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 18015 | |
| 2 | 8521 | |
| 3 | 7867 | |
| 0 | 4300 | 8.1% |
| 5 | 3455 | 6.5% |
| 4 | 3300 | 6.2% |
| 7 | 3051 | 5.8% |
| 6 | 2166 | 4.1% |
| 9 | 1295 | 2.4% |
| 8 | 990 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 25827 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12931 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 91718 | |
| Latin | 38793 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 10186 | |
| U | 9638 | |
| S | 9412 | |
| E | 1706 | 4.4% |
| N | 1692 | 4.4% |
| C | 1133 | 2.9% |
| M | 752 | 1.9% |
| G | 632 | 1.6% |
| K | 627 | 1.6% |
| X | 510 | 1.3% |
| Other values (16) | 2505 | 6.5% |
Common
| Value | Count | Frequency (%) |
| . | 25827 | |
| 1 | 18015 | |
| _ | 12931 | |
| 2 | 8521 | 9.3% |
| 3 | 7867 | 8.6% |
| 0 | 4300 | 4.7% |
| 5 | 3455 | 3.8% |
| 4 | 3300 | 3.6% |
| 7 | 3051 | 3.3% |
| 6 | 2166 | 2.4% |
| Other values (2) | 2285 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 130511 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 25827 | |
| 1 | 18015 | |
| _ | 12931 | |
| A | 10186 | 7.8% |
| U | 9638 | 7.4% |
| S | 9412 | 7.2% |
| 2 | 8521 | 6.5% |
| 3 | 7867 | 6.0% |
| 0 | 4300 | 3.3% |
| 5 | 3455 | 2.6% |
| Other values (28) | 20359 |
level2Name
Text
Missing 
| Distinct | 895 |
|---|---|
| Distinct (%) | 6.9% |
| Missing | 5935 |
| Missing (%) | 31.5% |
| Memory size | 147.5 KiB |
Length
| Max length | 31 |
|---|---|
| Median length | 27 |
| Mean length | 8.232000619 |
| Min length | 3 |
Unique
| Unique | 307 ? |
|---|---|
| Unique (%) | 2.4% |
Sample
| 1st row | New Haven |
|---|---|
| 2nd row | New Haven |
| 3rd row | Bucks |
| 4th row | Philadelphia |
| 5th row | Bucks |
| Value | Count | Frequency (%) |
| carroll | 2600 | 15.8% |
| new | 738 | 4.5% |
| haven | 626 | 3.8% |
| cass | 356 | 2.2% |
| litchfield | 328 | 2.0% |
| gunnison | 268 | 1.6% |
| dogiyai | 244 | 1.5% |
| north | 204 | 1.2% |
| aswan | 175 | 1.1% |
| no | 166 | 1.0% |
| Other values (988) | 10746 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11555 | 10.9% |
| r | 9803 | 9.2% |
| o | 9094 | 8.5% |
| l | 8669 | 8.1% |
| e | 7675 | 7.2% |
| n | 6688 | 6.3% |
| i | 6359 | 6.0% |
| s | 4313 | 4.1% |
| C | 3874 | 3.6% |
| 3520 | 3.3% | |
| Other values (76) | 34898 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 85822 | |
| Uppercase Letter | 16283 | 15.3% |
| Space Separator | 3520 | 3.3% |
| Decimal Number | 312 | 0.3% |
| Dash Punctuation | 303 | 0.3% |
| Other Punctuation | 208 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11555 | |
| r | 9803 | |
| o | 9094 | |
| l | 8669 | |
| e | 7675 | |
| n | 6688 | |
| i | 6359 | 7.4% |
| s | 4313 | 5.0% |
| t | 3151 | 3.7% |
| u | 2285 | 2.7% |
| Other values (31) | 16230 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3874 | |
| N | 1417 | 8.7% |
| H | 1012 | 6.2% |
| S | 996 | 6.1% |
| L | 991 | 6.1% |
| M | 913 | 5.6% |
| G | 633 | 3.9% |
| A | 630 | 3.9% |
| F | 619 | 3.8% |
| B | 617 | 3.8% |
| Other values (20) | 4581 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 144 | |
| 5 | 94 | |
| 2 | 35 | 11.2% |
| 9 | 19 | 6.1% |
| 3 | 7 | 2.2% |
| 7 | 5 | 1.6% |
| 8 | 3 | 1.0% |
| 4 | 3 | 1.0% |
| 6 | 1 | 0.3% |
| 0 | 1 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 171 | |
| ' | 36 | 17.3% |
| / | 1 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 3520 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 303 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 102105 | |
| Common | 4343 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11555 | 11.3% |
| r | 9803 | 9.6% |
| o | 9094 | 8.9% |
| l | 8669 | 8.5% |
| e | 7675 | 7.5% |
| n | 6688 | 6.6% |
| i | 6359 | 6.2% |
| s | 4313 | 4.2% |
| C | 3874 | 3.8% |
| t | 3151 | 3.1% |
| Other values (61) | 30924 |
Common
| Value | Count | Frequency (%) |
| 3520 | ||
| - | 303 | 7.0% |
| . | 171 | 3.9% |
| 1 | 144 | 3.3% |
| 5 | 94 | 2.2% |
| ' | 36 | 0.8% |
| 2 | 35 | 0.8% |
| 9 | 19 | 0.4% |
| 3 | 7 | 0.2% |
| 7 | 5 | 0.1% |
| Other values (5) | 9 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106155 | |
| None | 292 | 0.3% |
| IPA Ext | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11555 | 10.9% |
| r | 9803 | 9.2% |
| o | 9094 | 8.6% |
| l | 8669 | 8.2% |
| e | 7675 | 7.2% |
| n | 6688 | 6.3% |
| i | 6359 | 6.0% |
| s | 4313 | 4.1% |
| C | 3874 | 3.6% |
| 3520 | 3.3% | |
| Other values (56) | 34605 |
None
| Value | Count | Frequency (%) |
| é | 92 | |
| á | 52 | |
| í | 37 | |
| ú | 30 | 10.3% |
| ñ | 28 | 9.6% |
| ô | 11 | 3.8% |
| ó | 11 | 3.8% |
| ı | 6 | 2.1% |
| ö | 6 | 2.1% |
| ü | 6 | 2.1% |
| Other values (9) | 13 | 4.5% |
IPA Ext
| Value | Count | Frequency (%) |
| ə | 1 |
level3Gid
Text
Missing 
| Distinct | 353 |
|---|---|
| Distinct (%) | 15.2% |
| Missing | 16539 |
| Missing (%) | 87.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 11.96648045 |
| Min length | 11 |
Unique
| Unique | 157 ? |
|---|---|
| Unique (%) | 6.7% |
Sample
| 1st row | KEN.14.3.2_1 |
|---|---|
| 2nd row | KEN.14.3.2_1 |
| 3rd row | KHM.14.1.4_2 |
| 4th row | KEN.36.1.2_1 |
| 5th row | IDN.23.5.4_1 |
| Value | Count | Frequency (%) |
| idn.23.5.4_1 | 244 | 10.5% |
| ken.14.3.2_1 | 90 | 3.9% |
| ken.33.6.4_1 | 82 | 3.5% |
| can.2.13.1_1 | 77 | 3.3% |
| ecu.16.5.7_1 | 76 | 3.3% |
| ken.36.1.2_1 | 57 | 2.4% |
| can.1.6.6_1 | 53 | 2.3% |
| ken.33.5.1_1 | 51 | 2.2% |
| ecu.16.5.5_1 | 50 | 2.1% |
| can.2.4.38_1 | 47 | 2.0% |
| Other values (343) | 1500 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 6981 | |
| 1 | 4347 | |
| _ | 2327 | 8.4% |
| 2 | 1710 | 6.1% |
| 3 | 1593 | 5.7% |
| N | 1578 | 5.7% |
| C | 1059 | 3.8% |
| 4 | 1020 | 3.7% |
| 5 | 1001 | 3.6% |
| E | 852 | 3.1% |
| Other values (25) | 5378 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11557 | |
| Other Punctuation | 6981 | |
| Uppercase Letter | 6981 | |
| Connector Punctuation | 2327 | 8.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1578 | |
| C | 1059 | |
| E | 852 | |
| A | 734 | |
| K | 583 | 8.4% |
| D | 419 | 6.0% |
| I | 351 | 5.0% |
| R | 282 | 4.0% |
| G | 225 | 3.2% |
| U | 216 | 3.1% |
| Other values (13) | 682 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4347 | |
| 2 | 1710 | 14.8% |
| 3 | 1593 | 13.8% |
| 4 | 1020 | 8.8% |
| 5 | 1001 | 8.7% |
| 6 | 780 | 6.7% |
| 8 | 333 | 2.9% |
| 7 | 312 | 2.7% |
| 9 | 274 | 2.4% |
| 0 | 187 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 6981 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2327 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 20865 | |
| Latin | 6981 | 25.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1578 | |
| C | 1059 | |
| E | 852 | |
| A | 734 | |
| K | 583 | 8.4% |
| D | 419 | 6.0% |
| I | 351 | 5.0% |
| R | 282 | 4.0% |
| G | 225 | 3.2% |
| U | 216 | 3.1% |
| Other values (13) | 682 |
Common
| Value | Count | Frequency (%) |
| . | 6981 | |
| 1 | 4347 | |
| _ | 2327 | 11.2% |
| 2 | 1710 | 8.2% |
| 3 | 1593 | 7.6% |
| 4 | 1020 | 4.9% |
| 5 | 1001 | 4.8% |
| 6 | 780 | 3.7% |
| 8 | 333 | 1.6% |
| 7 | 312 | 1.5% |
| Other values (2) | 461 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27846 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 6981 | |
| 1 | 4347 | |
| _ | 2327 | 8.4% |
| 2 | 1710 | 6.1% |
| 3 | 1593 | 5.7% |
| N | 1578 | 5.7% |
| C | 1059 | 3.8% |
| 4 | 1020 | 3.7% |
| 5 | 1001 | 3.6% |
| E | 852 | 3.1% |
| Other values (25) | 5378 |
level3Name
Text
Missing 
| Distinct | 349 |
|---|---|
| Distinct (%) | 15.0% |
| Missing | 16544 |
| Missing (%) | 87.7% |
| Memory size | 147.5 KiB |
Length
| Max length | 32 |
|---|---|
| Median length | 26 |
| Mean length | 10.30275624 |
| Min length | 3 |
Unique
| Unique | 154 ? |
|---|---|
| Unique (%) | 6.6% |
Sample
| 1st row | Kibarani |
|---|---|
| 2nd row | Kibarani |
| 3rd row | Srae Khtum |
| 4th row | Gatarakwa |
| 5th row | Kamu Utara |
| Value | Count | Frequency (%) |
| utara | 245 | 6.6% |
| kamu | 244 | 6.5% |
| no | 93 | 2.5% |
| kibarani | 90 | 2.4% |
| siana | 82 | 2.2% |
| abbotsford | 77 | 2.1% |
| talag | 76 | 2.0% |
| kootenay | 63 | 1.7% |
| east | 63 | 1.7% |
| kumba | 61 | 1.6% |
| Other values (431) | 2644 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3597 | |
| r | 1603 | 6.7% |
| i | 1549 | 6.5% |
| o | 1491 | 6.2% |
| t | 1439 | 6.0% |
| 1416 | 5.9% | |
| n | 1275 | 5.3% |
| e | 1080 | 4.5% |
| u | 868 | 3.6% |
| m | 819 | 3.4% |
| Other values (67) | 8786 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18215 | |
| Uppercase Letter | 3591 | 15.0% |
| Space Separator | 1416 | 5.9% |
| Other Punctuation | 267 | 1.1% |
| Decimal Number | 256 | 1.1% |
| Dash Punctuation | 72 | 0.3% |
| Open Punctuation | 54 | 0.2% |
| Close Punctuation | 52 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3597 | |
| r | 1603 | |
| i | 1549 | |
| o | 1491 | 8.2% |
| t | 1439 | 7.9% |
| n | 1275 | 7.0% |
| e | 1080 | 5.9% |
| u | 868 | 4.8% |
| m | 819 | 4.5% |
| s | 643 | 3.5% |
| Other values (21) | 3851 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 653 | |
| U | 292 | 8.1% |
| M | 258 | 7.2% |
| N | 256 | 7.1% |
| S | 248 | 6.9% |
| C | 245 | 6.8% |
| A | 224 | 6.2% |
| G | 177 | 4.9% |
| D | 152 | 4.2% |
| I | 139 | 3.9% |
| Other values (18) | 947 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 64 | |
| 1 | 63 | |
| 3 | 59 | |
| 2 | 34 | |
| 5 | 17 | 6.6% |
| 7 | 8 | 3.1% |
| 6 | 3 | 1.2% |
| 8 | 3 | 1.2% |
| 4 | 3 | 1.2% |
| 0 | 2 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 148 | |
| , | 56 | 21.0% |
| / | 46 | 17.2% |
| ' | 17 | 6.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1416 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 72 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 54 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 52 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21806 | |
| Common | 2117 | 8.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3597 | |
| r | 1603 | 7.4% |
| i | 1549 | 7.1% |
| o | 1491 | 6.8% |
| t | 1439 | 6.6% |
| n | 1275 | 5.8% |
| e | 1080 | 5.0% |
| u | 868 | 4.0% |
| m | 819 | 3.8% |
| K | 653 | 3.0% |
| Other values (49) | 7432 |
Common
| Value | Count | Frequency (%) |
| 1416 | ||
| . | 148 | 7.0% |
| - | 72 | 3.4% |
| 9 | 64 | 3.0% |
| 1 | 63 | 3.0% |
| 3 | 59 | 2.8% |
| , | 56 | 2.6% |
| ( | 54 | 2.6% |
| ) | 52 | 2.5% |
| / | 46 | 2.2% |
| Other values (8) | 87 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23866 | |
| None | 57 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3597 | |
| r | 1603 | 6.7% |
| i | 1549 | 6.5% |
| o | 1491 | 6.2% |
| t | 1439 | 6.0% |
| 1416 | 5.9% | |
| n | 1275 | 5.3% |
| e | 1080 | 4.5% |
| u | 868 | 3.6% |
| m | 819 | 3.4% |
| Other values (60) | 8729 |
None
| Value | Count | Frequency (%) |
| é | 25 | |
| ñ | 24 | |
| í | 4 | 7.0% |
| Î | 1 | 1.8% |
| Ł | 1 | 1.8% |
| ń | 1 | 1.8% |
| ó | 1 | 1.8% |
Missing 
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 7581 |
| Missing (%) | 40.2% |
| Memory size | 147.5 KiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | LC |
|---|---|
| 2nd row | LC |
| 3rd row | LC |
| 4th row | LC |
| 5th row | LC |
| Value | Count | Frequency (%) |
| lc | 6500 | |
| ne | 3477 | |
| vu | 401 | 3.6% |
| en | 356 | 3.2% |
| nt | 314 | 2.8% |
| cr | 147 | 1.3% |
| dd | 85 | 0.8% |
| ex | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 6647 | |
| L | 6500 | |
| N | 4147 | |
| E | 3838 | |
| V | 401 | 1.8% |
| U | 401 | 1.8% |
| T | 314 | 1.4% |
| D | 170 | 0.8% |
| R | 147 | 0.7% |
| X | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 22570 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 6647 | |
| L | 6500 | |
| N | 4147 | |
| E | 3838 | |
| V | 401 | 1.8% |
| U | 401 | 1.8% |
| T | 314 | 1.4% |
| D | 170 | 0.8% |
| R | 147 | 0.7% |
| X | 5 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22570 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 6647 | |
| L | 6500 | |
| N | 4147 | |
| E | 3838 | |
| V | 401 | 1.8% |
| U | 401 | 1.8% |
| T | 314 | 1.4% |
| D | 170 | 0.8% |
| R | 147 | 0.7% |
| X | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22570 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 6647 | |
| L | 6500 | |
| N | 4147 | |
| E | 3838 | |
| V | 401 | 1.8% |
| U | 401 | 1.8% |
| T | 314 | 1.4% |
| D | 170 | 0.8% |
| R | 147 | 0.7% |
| X | 5 | < 0.1% |